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2B  DISCUSSING,  USING  AND  RECOGNIZING  PLANS 


2B.1  OVERVIEW 

This  project,  also  known  as  the  Natural  Language  Planning  project,  is  a  joint  project  of  a 
research  group  at  SUNY  at  Buffalo  (UB),  led  by  Dr.  Smart  C.  Shapiro,  and  a  research  group  at 
U.  Mass,  led  by  Dr.  Beverly  Woolf.  The  project  is  devoted  to  the  investigation  of  a  knowl^ge 
representation  design  compatible  with  the  intensional  knowledge  representation  theory  previ¬ 
ously  developed  by  Dr.  Shapiro  and  his  co-workers  and  capable  of  providing  a  natural  language 
interacting  system  with  the  ability  to  discuss,  use,  and  recognize  plans.  The  project  officially 
got  underway  on  August  13,  1987.  These  last  few  months  of  1987  were  devoted  to  reviewing 
the  extensive  literature  on  planning,  working  on  the  software  to  be  used  jointly  by  the  UB  and 
U.  Mass,  researchers,  developing  and  experimenting  with  an  initial  design  of  a  representation  of 
plans,  and  investigating  the  possible  integration  of  GRAPPLE,  the  U.  Mass,  plan  formalism 
with  SNePS,  the  UB  knowledge  represenution  system. 
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2BJ,  OBJECTIVES 

The  objectives  of  this  project  are  to: 

(1)  design  a  representation  for  plans  and  rules  for  reasoning  about  plans  within  an  esta¬ 
blished  knowledge  representation/reasoning  (KRR)  system;  enhance  the  KRR  system  so 
that  it  can  act  according  to  such  plans; 

(2)  write  a  grammar  to  direct  an  esublished  natural  language  processing  (NLP)  system  to 
analyze  English  sentences  about  plans  and  represent  the  semantic/conceptual  content  of 
the  sentences  in  the  representation  designed  for  objective  (l);  the  resulting  NLP  system 
should  be  able  to:  accept  sentences  describing  plans,  and  add  the  plans  to  its  “plan 
library”;  answer  questions  about  the  plans  in  its  plan  library;  accept  sentences  describ¬ 
ing  the  actions  of  others,  and  recognize  when  those  actions  constitute  the  carrying  out 
of  a  plan  in  its  plan  library. 

The  KRR  system  to  be  used  is  SNePS  (Shapiro  1979),  and  the  NLP  system  to  be  modified 
for  this  purpose  is  CASSEE  (Shapiro  &  Rapaport  1987).  The  UB  group  is  responsible  for 
enhancing  SNePS/CASSIE  according  to  the  objectives  listed  above.  The  U.  Mass,  group  is  respon¬ 
sible  for  testing  the  enhanced  system  in  the  specific  domains  of  the  Blocks  World,  tutoring,  and 
space  launch  narratives.  Communication  and  feedback  between  the  two  groups  will  greatly 
improve  the  work  of  both. 
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2BJ  TECHNICAL  PROGRESS  IN  1987 


2BJ.1  The  Underlying  Knowledge  Representation  Theory 

A  basic  principle  of  our  theory  of  intensional  knowledge  representation,  embodied  in  the 
design  and  use  of  SNePS  as  a  propositional  semantic  network,  is  The  Uniqueness  Principle  - 
that  there  be  a  one-to-one  mapping  between  nodes  of  the  semantic  network  and  concepts  (men¬ 
tal  objects)  about  which  information  may  be  stored  in  the  network.  These  concepts  are  not 
limited  to  objects  in  the  real  world,  but  may  be  various  ways  of  thinking  about  a  single  real 
world  object  (e.g.  the  Morning  Star  vs.  the  Evening  Star  vs.  Venus).  They  may  also  be  abstract 
objects  like  properties,  propositions.  Truth,  Beauty,  fictional  objects,  and  impossible  objects. 
They  may  include  specific  facts  as  well  as  general  facts,  and  even  rules,  which  can  be  believed, 
disbelieved,  or  followed  when  reasoning. 

It  is  a  major  hypothesis  of  the  current  project  that  plans  are  also  mental  objects  that  can 
be  represented  in  such  a  propositional  semantic  network.  We  can  discuss  plans  with  each  other, 
reason  about  them,  formulate  them,  follow  them,  and  recognize  when  others  seem  to  be  follow¬ 
ing  them.  An  AI  system,  using  SNePS  as  its  belief  structure  should  also  be  able  to  do  these 
things. 

Plans,  being  structures  of  actions,  states,  and  other  plans,  resemble  reasoning  rules,  which 
are  structures  of  beliefs.  However,  they  are  difiFerent  in  important  ways:  reasoning  rules  are 
rules  for  forming  new  beliefs  based  on  old  beliefs,  plans  are  rules  for  acting;  a  belief,  once 
formed,  need  not  be  formed  again,  an  action  may  need  to  be  performed  multiple  times;  the  tem¬ 
poral  order  of  assessing  old  beliefs  and  forming  new  beliefs  is  very  flexible,  the  temporal  order 
of  performing  actions  is  crucial  to  the  correct  carrying  out  of  a  plan.  The  representation  of  rea¬ 
soning  rules  in  SNePS,  and  the  algorithm  for  reasoning  according  to  them,  implemented  in  SNIP 
-  the  SNePS  Inference  Package,  have  been  carefully  designed  to  make  reasoning  flexible  and 
efficient.  In  this  project,  we  are  undertaking  a  sunilar  design  of  the  represenution  and  use  of 
plans.  The  various  objectives,  outlined  in  Section  2B2,  form  important  constraints  on  this 
aesign. 

2B3,2  A  First  Representation  of  Acts  and  Plans 

In  1987,  we  have  begun  experimenting  with  an  iniiiai  design  of  plans  and  of  an  acting 
system  that  uses  them.  Our  representation  of  acts  is  based  on  that  of  Almeida  (1987).  He  dis¬ 
tinguishes  the  nodes  for  an  act,  the  event  of  that  act’s  being  performed  by  a  particular  actor  at 
a  particular  time,  and  the  proposition  of  that  event’s  having  occurred.  The  benefit  of  this  dis¬ 
tinction  is  that  the  same  node  may  be  used  to  represent  the  same  act  (as  required  by  the 
Uniqueness  Principle)  no  matter  who  performs  it  and  no  matter  when  performed  Our  initial 
representation  of  an  act  is  a  node  with  2m  ACTION  arc  to  a  node  that  represents  the  action,  and 
OBJECTl,  _,  OBJECTn  arcs  to  the  required  objects  of  the  action,  as  shown  in  Fig.  1.  For  exam¬ 
ple,  the  SNePSUL  command  for  building  a  node  representing  the  act  of  saying  'TOO"  is: 

(buUd  action  say  objectl  FOO) 


Figure  1.  Mx  represents  the  act  of  performing  <3ctian>  on  the<object>s. 
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It  is  necessary  to  distinguish  between  primitive  and  complex  (non-primitive)  actions.  A 
primitive  action  is  one  that  the  system  is  capable  of  performing.  The  system  is  not  capable  of 
performing  a  complex  action,  so  in  order  to  carry  out  a  complex  action,  the  system  must  decom¬ 
pose  it  into  a  structure  of  other  actions,  which,  ultimately,  must  be  primitive.  We  assert  that 
an  action  is  primitive  using  the  standard  SNePS  MEMBER -CLASS  case  frame  shown  in  Fig.  2. 
To  assert  that  “saying”  is  primitive,  we  execute  the  SNePSUL  command: 

(build  member  say  class  primitive) 

Complex  actions  are  those  actions  not  asserted  to  be  primitive. 


<Bctian>  PRIMITIVE 


Figure  2.  Mx  represents  the  proposition  that  <action>  is  a  primitive  action. 


A  complex  action  is  performed  by  decomposing  it  into  a  structure  of  simpler  actions, 
which  constitute  a  plan  for  carrying  out  the  complex  action.  Our  representation  presupposes 
the  hypothesis  that  this  decomposition  must  take  the  objects  of  the  action  into  account,  but 
needn’t  take  the  actor  into  account.  Le.,  the  decomposition  relation  holds  between  two  acts,  and 
is  represented  as  shown  in  Fig.  3.  The  act  at  the  end  of  the  PLAN  arc  must  be  more  decom¬ 
posed  than  the  act  at  the  end  of  the  A^T  arc.  E.g..  it  may  be  the  case  that  actl  is  complex, 
while  act2  is  primitive.  (We  will  say  that  an  act  is  primitive  or  complex  Just  when  its  action 
is.)  We  can  use  a  SNePS  rule  to  assert  that  the  plan  for  “asserting”  anything  is  to  “say”  it  by 
executing: 


(build  avb  $x 

act  (build  action  assert  object  1  *x) 
plan  (build  action  say  objectl  ^x)) 


PigTire  3.  Mx  represents  the  proposition  that  <act2>  oonstitiites  a  plan 
for  carrying:  out  <act/>.  <act2>  must  be  ntore  decomposed  than  <actf>. 
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This  kind  of  plan  is  one  for  carrying  out  a  complex  action.  Another  lund  of  plan  is  one 
for  achieving  some  state.  The  representation  of  that  kind  of  plan  is  shown  in  Fig.  4.  For  the 
representations  of  propositions,  we  use  the  constructs  shown  in  this  report  and  those  shown  in 
(Shapiro  &  Rapaport  1987). 


<proposition>  <act> 

Figure  4.  Mx  represents  the  proposition  that  <act>constitutes  a  plan 
for  achieving  a  state  in  which  <proposition>  is  true. 

The  system  may  already  know  a  plan  for  a  goal  or  complex  act.  However,  if  it  does  not 
already  have  such  a  plan,  it  may  try  to  produce  it  by  reasoning  about  the  complex  act  or  goal, 
effects  of  acts,  etc.  Such  reasoning  constitutes  the  planning  activity.  Effects  of  an  act  may  be 
asserted  into  the  SNePS  network  just  like  any  other  beliefs.  Initially,  we  use  the  representation 
shown  in  Fig.  5,  although  it  is  certainly  true  that  the  effects  of  an  act  may  depend  on  the  actor, 
so  this  representation  is  too  simplistic.  Another  simplistic  assumption  we  arc  making  initially 
is  that  all  effects  of  a  performed  act  occur.  Examples  of  effect  assertions  will  be  shown  in  the 
next  section. 

Initially,  we  are  treating  preconditions  as  conditions  on  plans.  Examples  will  be  shown 
in  the  next  section.  This  is  clearly  inadequate,  presupposing  that  planning  is  done  “on  the  fly” 
as  the  acts  are  performed,  and  in  the  upcoming  year,  we  will  investigate  advanced  planning, 


<act>  <proposition> 


Figure  5.  Mx  represents  the  proposition  that  the  effect  of  performing  the<act> 
is  that  the<proposition>  becomes  true. 
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the  use  of  states  as  preconditions,  and  reasoning  about  effects  as  a  method  of  planning. 

2B,3  J  An  Acting  System 

The  initial  experimental  acting  system  is  composed  of  an  acting  executive  (called  “snact”) 
and  a  queue  of  acts  to  be  carried  out.  The  top-level  algorithm  is: 

SNACItQUEUE) 

REPEAT 

remove  FIRST- ACT  from  QUEUE 
IF  FIRST- ACT  is  primitive  THEN 
DO  FIRST- ACT 
INFER  effects  of  FTRST-ACT, 

and  schedule  the  believing  of  them 
ELSE  {  FTRST-ACT  is  complex  } 

DEDUCE  a  plan  for  carrying  out  FIRST-ACT, 
and  add  it  to  QUEUE 
ENTilF 

UNTIL  QUEUE  is  empty. 


Primitive  actions  may  be  supplied  to  the  system  by  programming  them  in  Lisp.  One  of 
the  simplest  primitive  action  we  have  wntten  is  “say”  -  the  action  of  printing  something  on 
the  output  device,  used  as  an  example  above.  The  code  defining  say  is: 

(defun  say  (n) 

(format  t  "”&’A~9!3" 

(first-atom  (pathget  n  ’object  1)))) 

Sev<-ral  things  may  be  noted  about  this  definition:  the  action  takes  an  act  node  with  itself  as 
action  as  its  argument  so  the  function  call  of  acts  is  consistent  regardless  of  the  number  of 
arguments  the  action  takes;  pathget  takes  a  node  and  a  path  of  arc  label,  and  returns  the  list  of 
nodes  at  the  end  of  the  given  path  from  the  given  node;  first-atom  returns  the  first  node  in  the 
list  under  the  assumption  that  each  argument  of  a  say  act  will  be  a  single  node.  In  the 
remainder  of  this  report,  we  will  refrain  from  showing  actual  Lisp  code  of  primitive  acts,  but 
will  describe  them  in  the  format; 

(say  <node>)  Primitive-action 
"Prints  <node>  on  the  output  device" 

The  arguments  will  be  shown  in  a  way  that  indi''ates  their  types. 

So  that  we  can  explain  the  circumstances  under  which  nothing  need  be  done,  we  have 
defined  the  noop  action: 

(noop)  Primitive-action 
"Does  nothing." 


TTie  snact  algorithm  includes  the  scheduling  of  believing  that  the  effects  of  acts  have  hap¬ 
pened.  This  makes  use  of  the  primitive  believe  action: 
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(believe  <proposition>)  Primitive-action 
"Causes  the  system  to  believe  <  proposition  >,  and  removes  from  the 
system  any  belief  in  < proposition  >’s  negation." 

Notice  that  this  is  an  internal,  mental  action. 

Another  primitive  action  we  have  written  is  an  important  paradigm  for  our  design  and 
experimentation  with  representations  of  plans.  It  is  the  primitive  action  of  performing  two 
sub-acts  in  sequence: 


(snsequence  <actl>  <act2>)  Primitive-action 
"Puts  <actl>  followed  by  <act2>  on  the  front  of  the  act  queue" 

Since  either  or  both  sub-acts  can  themselves  be  snsequence  acts,  we  have  a  general  structure  for 
plans  of  sequential  actions.  Using  this  action  as  a  model,  we  next  will  experiment  with  the 
design  of  other  control  structure  actions. 

Notice  that  snsequence  is  also  a  mental  action,  and  is  an  “intention  forming"  action  in  the 
sense  that  putting  an  act  on  the  act  queue  models  a  cognitive  agent’s  forming  the  intention  of 
performing  that  act. 

One  may  also  intend  to  achieve  a  goal.  The  goal,  itself,  being  propositional,  should  not  be 
put  on  the  act  queue.  For  that  purpose,  we  have  the  achieve  action: 

(achieve  <  proposition  >)  Primitive-action 
'Tinds  or  infers  a  plan  for  achieving  a  state  in  which  the 
proposition  holds,  and  schedules  the  carrying  out  of  that  plan." 

Since  more  than  one  plan  may  be  found,  we  have  defined  a  choose-plan  operation.  Initially, 
choose-plan  chooses  a  plan  at  random,  but  we  will  investigate  the  creation  of  rules  and  plans 
for  choosing  plans  more  intelligently. 

2B3A  The  Blocks  World 

The  first  domain  for  our  acting  system  is  the  well-known  blocks  world. 

The  predicates  used  in  the  Blocks  World  are: 

(clear  <block>)  Predicate 
"<block>  has  no  blocks  on  top  of  it." 

(ontable  <block>)  Predicate 
"<block>  is  sitting  directly  on  the  table." 

(holding  <block>)  Predicate 
"The  robot  is  holding  <block>." 

The  Blocks  World  relations  are: 

(on  <blockl>  <block2>)  Relation 
"<blockl>  is  directly  on  top  of  <block2>." 

The  primitive  Blocks  World  actions  are: 
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(pickup  <block>)  Primitive-action 
"The  robot  picks  up  <block>  from  the  table." 

(putdown  <block>)  Primitive-action 
"The  robot  puts  <block>  down  on  the  table." 

(stack  <blockl>  <block2>)  Primitive-action 
"The  robot  stacks  <blockl>  on  top  of  <block2>." 

(unstack  <blockl>  <biock2>)  Primitive-action 
'The  robot  picks  <blockl>  up  from  on  top  of  <block2>." 

Since  we  do  not  have  an  actual  robot  arm,  these  actions  are  simulated  by  printing  appropriate 
messages  to  the  output  device. 

The  effects  of  the  primitive  actions  are  asserted  by  the  following  SNePSUL  commands, 
where  the  preceding  comments  explain  the  statements. 

After  picking  up  a  block,  the  block  is  not  clear. 

(build  avb  Sblock 

act  (build  action  pickup  objectl  *block)  =  PICKUP-BLOCK 
effect  (build  min  0  max  0 

arg  (build  property  clear  object  *block)  »  BLOCK -IS-CLEAR) 

=  BLOCK-IS-NOT-CLEAR) 

s;  After  picking  up  a  block,  the  block  is  not  on  the  table. 

(build  avb  *block 

act  *PICKUP-BLOCK 
effect  (build  min  0  max  0 

arg  (build  property  ontable  object  •block)  -  BLOCK-ON-TABLE)) 

;;;  After  picking  up  a  block,  the  robot  is  bolding  the  block. 

(build  avb  *block 

act  •PICKUP-BLOCK 

effect  (buUd  property  holding  object  •block)  -  HOLDING-BLOCK) 

s;  After  putting  down  a  block,  the  robot  is  not  holding  the  block. 

(build  avb  •block 

act  (build  action  putdown  objectl  •block)  -  PUTDOWN-BLOCK 

effect  (build  min  0  max  0  arg  •HOLDING-BLOCK)  -  NOT-HOLDING-BLOCK) 

;;;  After  putting  down  a  block,  the  block  is  clear. 

(build  avb  •block 

act  •PUTDOWN-BLOCK 
effect  •BLOCK-IS-CLEAR) 

s;  After  putting  down  a  block,  the  block  is  on  the  table. 

(build  avb  •block 

act  •PUTDOWN-BLOCK 
effect  •BLOCK-ON-TABLE) 

;;;  After  stacking  one  block  on  another,  the  robot  is  no  longer  holding  it. 

(build  avb  (•block  $other-block) 
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act  (build  action  stack  objectl  *block  object2  *othcr-block) 

-  STACK-ONE-ON-OTHER 
effect  *NOT-HOLDING-BLOCK) 

;;;  After  stacking  one  block  on  another,  the  latter  is  no  longer  clear. 

(build  avb  (*block  *other-block) 
act  ‘STACX-ONE-ON-OTHER 
effect  (build  min  0  max  0 

arg  (build  property  clear  object  ♦other-block) 

-  OTHER-IS-CLEAR)) 

s;  After  stacking  one  block  on  another,  the  former  is  on  the  latter. 

(build  avb  (*block  ♦other-block) 
act  ♦STACK-ONE-ON-OTHER 

effect  (build  rel  on  argl  ♦block  arg2  ♦other-block)  «  ONE-ON  -OTHER) 

s;  After  stacking  one  block  on  another,  the  former  is  clear. 

(build  avb  (♦block  ♦other-block) 
act  ♦STACK-ONE-ON-OTHER 
effect  ♦BLOCK-IS-CLEAR) 

;;;  After  unstacking  one  block  from  another,  it  is  no  longer  clear. 

(build  avb  (♦block  ♦other-block) 

act  (build  action  unstack  objectl  ♦block  object2  ♦other-block) 

-  UNSTACK-ONE-FROM-OTHER 
effect  ♦BLOCK-IS-NOT-CLEAR) 

Z  After  unstacking  one  block  from  another,  it  is  no  longer  on  the  other, 
(build  avb  (♦block  *other-block) 

act  ♦UNSTACK-ONE-FROM-OTHER 

effect  (build  min  0  max  0  arg  ♦ONE-ON-OTHER)) 

Z  After  unstacking  one  block  from  another,  the  latter  is  clear. 

(build  avb  (♦block  ♦other-block) 

act  ♦UNSTACK-ONE-FROM-OTHER 
effect  *OTHER-IS-CLEAR) 

;;;  After  unstacking  one  block  from  tmother,  the  robot  is  bolding  the  former, 
(build  avb  (♦block  ♦other-block) 

act  ♦UNSTACK-ONE-FROM-OTHER 
effect  ♦HOLDING-BLOCK) 


Next,  we  describe  some  Blocks  World  plans. 

Plans  for  achieving  the  holding  of  a  block 

Z  If  the  robot  is  not  already  holding  some  block,  Bl,  and  B1  is  on  some 
;;;  other  block,  B2,  and  Bl  is  clear,  then  unstacking  Bl  from  B2  is  a  plan 
;;;  for  holding  Bl. 

(buUd  avb  (♦block  ♦other-block) 

&ant  ((build  I-/- 1  arg  ♦HOLDING-BLOCK) 

♦ONE-ON-OTHER 

♦BLOCK-IS-CLEAR) 
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cq  (buUd  plan  ‘UNSTACK-ONE-FROM-OTHER 
goal  ‘HOLDING-BLOCK)) 


;;;  If  a  block  is  on  the  table  and  clear,  then  picking  it  up  is  a  plan  for 
;;;  holding  it. 

(build  avb  *block 

&ant  (‘BLOCK-ON-TABLE 
‘BLOCK-IS-CLEAR) 
cq  (build  plan ‘PICKUP-BLOCK 

goal  ‘HOLDING-BLOCK)) 

;;;;  If  the  robot  is  already  holding  a  block,  then  doing  nothing  is  a  plan 
;s:  for  holding  it. 

(build  avb  ‘block 

ant  ‘HOLDING-BLOCK 

cq  (build  plan  (build  action  noop)  «  DO-NOTHING 
goal  ‘HOLDING-BLOCK)) 

Plans  for  getting  a  block  on  the  table 

-  If  some  block  is  not  on  the  table,  then  the  robot  can  get  it  there  by 
;;;  first  holding  the  block,  then  putting  it  down. 

(buUd  avb  ‘block 

ant  (build  I-/- 1  arg  ‘BLOCK-ON-TABLE) 
cq  (buUd  plan  (buUd  action  snsequence 

object  1  (build  action  achieve 

objectl  ‘HOLDING-BLOCK) 
-  HOLD-BLOCK 
object2  ‘PUTDOWN-BLOCK) 
goal  ‘BLOCK-ON-TABLE)) 

;;;  If  a  block  is  already  on  the  table,  then  doing  nothing  is  a  plan  for 
;;;  putting  it  there. 

(buUd  avb  ‘block 

ant  ‘BLOCK-ON-TABLE 
cq  (build  plan  ‘DO-NOTHING 

goal  *BLOCK-ON-TABLE)) 

Plans  for  getting  one  block  on  top  of  another 

;;;  If  block  B1  is  already  on  top  of  block  B2,  then  one  need  do  nothing  to 
Z  get  it  there. 

(buUd  avb  (‘block  ‘other-block) 
ant  ‘ONE-ON-OTHER 
cq  (build  plan  ‘DO-NOTHING 

goal  ‘ONE-ON-OTHER)) 

If  block  B1  is  not  already  on  top  of  block  B2,  then  to  put  it  there, 

;;;  first  clear  B2,  then  hold  Bl,  then  stack  B1  on  top  of  B2. 

(build  avb  (‘block  ‘other-block) 

ant  (build  I-/- 1  arg  *ONE-ON-OTHER) 
cq  (build  plan  (buUd  action  snsequence 

objectl  (build  action  achieve 
2B-16 


objectl  *OTHER-IS-CLEAR) 
object2  (build  action  snsequence 

objectl  *HOLD-BLOCK 

object2  ♦STACK-ONE-ON-OTHER)) 

goal  *ONE-ON-OTHER)) 

Plans  for  clearing  a  block 

;;;  If  a  block  is  already  clear,  nothing  needs  to  be  done  to  clear  it. 

(build  avb  ♦block 

ant  *BLOCK-IS-CLEAR 
cq  (build  plan  ♦DO-NOTHING 

goal  *BLOCK-IS-CLEAR)) 

;;;  If  block  B1  is  on  top  of  block  B2,  then  to  clear  B2, 
s;  first  clear  Bl,  then  put  B1  on  the  table. 

(build  avb  (♦block  ♦other-block) 
ant  ♦ONE-ON-OTHER 
cq  (build  plan  (build  action  snsequence 

objectl  (build  action  achieve 

objectl  ♦BLOCK-IS-CLEAR) 
object2  (build  action  achieve 

objectl  ♦BLOCK-ON-TABLE)) 

goal  ♦OTHER-IS-CLEAR)) 

A  plan  for  building  a  stack  of  three  blocks 

s;  To  build  a  stack  of  three  blocks,  Bl  on  B2  on  B3, 

Z  first  put  B3  on  the  table,  then  put  B2  on  B3,  then  put  Bl  on  B2. 

(build  avb  (♦block  ♦other-block  Sthird-block) 
act  (build  action  make-3-stack 
objectl  •third-block 
object2  ♦block 
object3  ♦other-block) 
plan  (buUd  action  snsequence 

objectl  (build  action  achieve 

objectl  (build  property  ontable 

object  ♦other-block)) 
object2  (buUd  action  snsequence 
objectl 

(build  action  achieve 

objectl  ♦ONE-ON-OTHER) 

object2 

(buUd  action  achieve 

objectl  (build  rel  on 

argl  ♦third -block 
arg2  ♦block))))) 

The  following  is  a  run  of  the  system  after  loading  the  actions,  effects,  plans,  etc.  discussed 
above.  Comments  are  on  lines  beginning  with  The  SNePS  prompt  is  Other  lines  are 
SNePS  output.  The  only  editing  has  been  the  elimination  of  extra  blank  lines  and  some  stray 
trace  prints. 
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First,  we  will  ask  the  system  some  questions  about  the  actions  and  plans  it  knows  about. 

:  What  actions  do  you  know  about? 

*  (find  action-  ?x) 

(make-3-stack  unstack  stack  putdown  pickup  noop  achieve  snsequence) 
exec:  0.26  sec  gc:  0.00  sec 

;  What  are  the  primitive  actions? 

*  (find  (member-  class)  primitive) 

(unstack  stack  putdown  pickup  forget  believe  noop  say  achieve  snsequence) 
exec:  0.06  sec  gc:  0.00  sec 

;  What  complex  actions  do  you  know  about? 

*  ((find  action-  ?x)  -  (find  (member-  class)  primitive)) 

(make-3-stack) 

exec  0.26  sec  gc  0.00  sec 

;  What  are  the  effects  of  picking  up  a  block? 

*  (desc  (find  (art  action)  pickup  effect  ?x)) 

(ml6  (effect  (ml5  (object  (vl))  (property  (holding)))) 

(act  (m8  (objectl  (vl))  (action  (pickup)))) 

(avb  (vl))) 

(ml4  (effect 

(ml3  (arg  (ml2  (object  (vl))  (property  (ontable))))  (max  (0))  (min  (O)))) 

(art  (m8  (objectl  (vl))  (action  (pickup)))) 

(avb  (vl))) 

(mil  (effect 

(mlO  (arg  (m9  (object  (vl))  (property  (clear))))  (max  (0))  (min  (0)))) 

(act  (m8  (objectl  (vl))  (action  (pickup)))) 

(avb  (vl))) 

(dumped) 

exec  0.31  sec  gc  0.00  sec 

;  How  would  you  make  a  stack  of  three  blocks? 

*  (desc  (find  (act  action)  make-3-stack  plan  ?x)) 

(m74  (plan 

(m73  (object2 
(m72  (object2 

(m71  (objectl  (m70  (arg2  (vl))  (argl  (v3))  (rel  (on)))) 

(action  (achieve)))) 

(objectl 

(m69  (objectl  (m27  (arg2  (v2))  (argl  (vl))  (rel  (on)))) 

(action  (achieve)))) 

(action  (snsequence)))) 

(objectl 

(m68  (objectl  (m67  (object  (v2))  (property  (ontable)))) 

(action  (achieve)))) 

(action  (snsequence)))) 
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(m66  (objects  (v2)) 

(objca2  (vl)) 

(objeal  (v3)) 

(action  (make-3-stack)))) 
(avb  (v3)  (v2)  (vl))) 
(dumped) 

exec:  0^8  sec  gc:  0.00  sec 


We  will  now  demonstrate  having  this  system  solve  the  Blocks  World  problem  shown 
6. 


A 


B 


C 


Figure  6.  A  Blacks  World  problem  to  be  solved- 


;;;  First,  we  define  the  initial  state  of  the  problem. 
;  C  is  clear. 

•  (describe  (forbtop  property  clear  object  C)) 

(m75  (object  (C))  (property  (clear))) 

(dumped) 

exec  0.15  sec  gc  0.00  sec 
;  A  is  on  C 

•  (describe  (fo’^btop  rel  on  argl  C  arg2  A)) 

(m76  (arg2  (A))  (argl  (O)  (rel  (on))) 

(dumped) 

exec  0.10  sec  gc  0.00  sec 
;  A  is  on  the  table. 

*  (describe  (forbtop  property  onuble  object  A)) 
(m77  (object  (A))  (property  (ontable))) 

(dumped) 

exec  0.08  sec  gc  0.00  sec 
;  B  is  clear. 

*  (describe  (forbtop  property  clear  object  B)) 

(m78  (object  (B))  (property  (clear))) 
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(dumped) 

exec:  0.10  sec  gc:  0.00  sec 


;  B  is  on  the  table. 

*  (describe  (forbtop  property  on  table  object  B)) 

(m79  (object  (B))  (property  (ontable))) 

(dumped) 

exec:  0.10  sec  gc:  0.00  sec 

;;;  Then  we  ask  the  system  to  build  a  stack. 

;  Make  a  stack  of  A  on  B  on  C. 

*  (snact  (build  action  make-3-stack  objectl  A  object2  B  objects  O) 

Now  doing:  UNSTACK  C  from  A. 

Now  doing:  DISBELIEVE: 

(m75  (object  (C))  (property  (clear))) 

Now  doing:  DISBELIEVE: 

(m76  (arg2  (A))  (argl  (O)  (rel  (on))) 

Now  doing:  BELIEVE: 

(ml04  (object  (A))  (property  (clear))) 

Now  doing:  BELIEVE: 

(m92  (object  (C))  (property  (holding))) 

Now  doing:  PUTDOWN  C  on  table. 

Now  doing:  DISBELIEVE: 

(m92  (object  (C))  (property  (holding))) 

Now  doing:  BELIEVE: 

(m75  (object  (C))  (property  (clear))) 

Now  doing:  BELIEVE: 

(m87  (object  (C))  (property  (ontable))) 

Now  doing:  NOOP 

Now  doing:  PICKUP  B  from  table. 

Now  doing:  DISBELIEVE: 

(m78  (object  (B))  (property  (clear))) 

Now  doing:  DISBELIEVE: 

(m79  (object  (B))  (property  (ontable))) 

Now  doing:  BELIEVE: 

(ml26  (object  (B))  (property  (holding))) 

Now  doing:  STACK  B  on  C. 
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Now  doing:  DISBELIEVE: 

(ml26  (object  (B))  (property  (holding))) 


Now  doing:  DISBELIEVE: 

(m75  (object  (C))  (property  (clear))) 

Now  doing:  BELIEVE: 

(m84  (arg2  (C))  (argl  (B))  (rel  (on))) 

Now  doing:  BELIEVE; 

(m78  (object  (B))  (property  (clear))) 

Now  doing;  NOOP 

Now  doing;  PICKUP  A  from  table. 

Now  doing:  DISBELIEVE; 

(ml  04  (object  (A))  (property  (clear))) 

Now  doing:  DISBELIEVE: 

(m77  (object  (A))  (property  (ontable))) 

Now  doing;  BELIEVE 

(ml63  (object  (A))  (property  (holding))) 

Now  doing:  STACK  A  on  B. 

Now  doing:  DISBELIEVE; 

(ml 63  (object  (A))  (property  (holding))) 

Now  doing:  DISBELIEVE 
(m78  (object  (B))  (property  (clear))) 

Now  doing;  BELIEVE 

(m82  (arg2  (B))  (argl  (A))  (rel  (on))) 

Now  doing:  BELIEVE 

(ml04  (object  (A))  (property  (clear))) 

nil 

exec:  36.00  sec  gc:  7.60  sec 

We  can  now  find  out  the  final  state  by  asking  about  all  the  predicates  and  relations. 


;  What  objects  are  clear? 

♦(describe  (deduce  property  clear  object  %x)) 

(ml04  (object  (A))  (property  (clear))) 

(ml50  (arg  (m75  (object  (C))  (property  (clear))))  (max  (0))  (min  (0))) 
(ml  86  (arg  (m78  (object  (B))  (property  (clear))))  (max  (0))  (min  (0))) 
(dumped) 

exec:  1.83  sec  gc:  0.00  sec 
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:  What  objects  are  on  the  table? 

*(describe  (deduce  property  ontable  object  %x)) 

(m87  (object  (C))  (property  (ontable))) 

(ml  75  (arg  (m77  (object  (A))  (property  (ontable))))  (max  (0))  (min  (0))) 
(ml  39  (arg  (m79  (object  (B))  (property  (ontable))))  (max  (0))  (min  (0))) 
(dumped) 

exec:  1.71  sec  gc:  0.00  sec 

;  What  objects  are  being  held? 

♦(describe  (deduce  property  holding  object  %x)) 

(ml  15  (arg  (m92  (object  (O)  (property  (holding))))  (max  (0))  (min  (0))) 
(ml48  (arg  (ml26  (object  (B))  (property  (holding))))  (max  (0))  (min  (O))) 
(ml 84  (arg  (mi 63  (object  (A))  (property  (holding))))  (max  (0))  (min  (O))) 
(dumped) 

exec:  2.35  sec  gc:  0.00  sec 

;  What  objects  are  on  what  other  objects? 

♦(describe  (deduce  rel  on  argl  %x  arg2  %y)) 

(m82  (arg2  (B))  (argl  (A))  (rel  (on))) 

(m84  (arg2  (C))  (argl  (B))  (rel  (on))) 

(ml02  (arg  (m76  (arg2  (A))  (argl  (C))  (rel  (on))))  (max  (0))  (min  (0))) 
(dumped) 

exec:  1.35  sec  gc:  0.00  sec 


Conclusions 

We  have  designed  and  implemented  a  first  version  of  representing  acts  and  plans  in  SNePs, 
and  of  a  SNePS  acting  component.  We  have  shown  primitive  acts,  their  effects,  plans  for  com¬ 
plex  acts,  and  plans  for  achieving  goals  being  explained  to  the  system  in  short  declarations,  each 
of  which  can  be  seen  to  be  expressible  in  an  easily  understood  English  sentence.  We  have 
shown  the  system  answering  questions  about  its  actions  and  plans.  We  have  shown  the  system 
using  its  plans  to  carry  out  a  Blocks  World  problem. 

Our  representation  distinguishes  actions,  acts,  propositions,  decomposition  (act-based)  plans, 
and  goal-oriented  (state-based)  plans.  It  also  distinguishes  between  primitive  actions/acts  and 
complex  actions/acts.  We  have  actions  that  the  system  performs  on  the  world  (simulated  by 
printing  onto  the  output  device),  such  as  pickup  and  putdown,  and  internal,  mental  actions, 
such  as  believe.  We  also  have  intention-forming  actions,  such  as  achieve,  whose  effects  are  to 
place  acts  on  the  act  queue.  The  intention-forming  action  snsequence  is  a  control-structure 
action  which  permits  us  to  represent  and  carry  out  structured  plans. 

Our  next  tasks  are  clear.  We  will  modify  the  current  CASSIE  grammar  so  that  the  kinds 
of  conversations  shown  in  this  report  can  be  carried  out  in  English  instead  of  in  SNePSUL.  We 
wiU  modify  our  Blocks  World  representation  to  more  closely  mimic  that  of  (Huff  &  Lesser 
1987).  We  wUl  add  additional  primitive  control  structure  actions.  We  will  modify  our 
representation  of  plans  so  that  advanced  planning  can  be  carried  out.  We  will  begin  investigat¬ 
ing  the  use  of  these  plans  to  recognize  when  other  actors  are  carrying  out  plans  the  system 
knows  about.  We  will  begin  representing  plans  from  the  tutoring  domain. 
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2B.4  SOFTWARE  DEVELOPMENT 


SN'ePS-2  is  a  new  design  of  SNePS,  incorporating  advances  in  our  theory  of  intensional 
knowledge  representation  made  since  the  original  SNePS  was  designed  and  implemented. 
SNePS-2  is  being  implemented  in  Common  Lisp  on  Texas  Instruments  Explorers,  Symbolics  Lisp 
Machines,  and  HP  AI  Workstations.  The  current  implementation  of  SNePS-2  includes  the  core 
SNePS  functions  of  explicit  storage  into  and  retrieval  from  SNePS  networks,  and  a  Generalized 
ATN  (Shapiro  1982)  Grammar  interpreter.  This  implementation  has  been  delivered  by  the 
group  at  UB  to  the  group  at  U.  Mass.  Implementation  of  SNIP-2  has  begun. 


2B^  REFERENCES 


(1)  Almeida,  M.  J.  Reasoning  about  the  Temporal  Structure  of  Narratives,  Technical  Report 
No.  87-10,  Department  of  Computer  Science,  SUN"Y  at  Buffalo,  Buffalo,  NY,  1987. 

(2)  Huff,  K.  E.  &  Lesser,  V.  R.  The  GRAPPLE  Plan  Formalism,  COINS  Technical  Report  87- 
08,  Department  of  Computer  and  Information  Science,  U.  Mass.,  Amherst,  MA,  1987. 

(3)  Shapiro,  S.  C.  ITie  SNePS  semantic  network  processing  system.  In  N.  V.  Findler,  ed. 
.Associative  Networks:  The  Representation  and  Use  of  Knowledge  by  Computers. 
Academic  Press,  New  York,  1979,  179-203. 

(4)  Shapiro,  S.  C.  Generalized  augmented  transition  network  grammars  for  generation  from 
semantic  networks.  American  Journal  of  Computational  Linguistics  8,  1  (January- 
March  1982),  12-25. 

(5)  Shapiro,  S.  C.  &  Rapaport.  W.  J.  SNePS  considered  as  a  fully  intensional  propositional 
semantic  network.  In  G.  McCalla  &  N.  Cercone,  eds.  The  Knowledge  Frontier:  Essays 
in  the  Representation  of  Knowledge.  Springer- Verlag,  New  York,  1987,  262-315. 


2B-25 


2B.6  UST  OF  PUBLICATIONS  -  DISCUSSING,  USING  AND  RECOGNIZING  PLANS 


(1)  Srihan.  S.  N..  Rapaport.  W.  J.,  Kumar,  D.  [1987]  "On  Knowledge  Representation  Using 
Semantic  Networks  and  Sansknt",  Technical  Report  87-03,  Department  of  Computer 
Science,  SUXY  at  Buffalo,  Buffalo,  NT. 

(2)  Neal,  I  G.  and  Shapiro,  S.  C^  [l987]  Knowledge-Based  Parsing,  Natural  Language  Pars¬ 
ing  Systems.  L.  Bole,  ed.,  Sprmger-Verlag,  Berlin,  49-92. 

(3)  Yuhan,  k.  H.  and  Shapiro,  S.  C.  [1987]  "Design  of  an  Incremental  Compiler  for  a 
Production -system  .ATN  Machine",  Technical  Report  No.  87-09,  Department  of  Computer 
Science,  SUNY'  at  Buffalo,  38  pp. 

(4)  S.  C.  Shapiro,  R.  J.  Rapaport,  [1987]  S.N'ePS  Considered  as  a  Fully  Intensional  Proposi¬ 
tional  Semantic  Network,  The  Knowledge  Frontier,  N.  Cercone,  G.  McCalla,  eds,. 
Springer- Verlag,  New  York,  263-315. 


2B-27 


2B.7  TRIPS  FUNDED  BY  RADC 

Reliability  and  Maintainability  Symposium,  Philadelphia,  PA,  January  27-29,  1987:  Shapiro. 

N'AIC  Executive  Committee  Meeting,  RPI,  February  2,  1987;  Shapiro. 

Conference  on  AI  Applications  by  IEEE,  Orlando,  Florida,  February  25-27,  1987:  Taie. 

.N'.AIC  Executive  Committee  Meeting,  Syracuse,  NY,  March  27,  1987:  Shapiro. 

NAIC  Spring  Technology  Fair,  RADC,  April  9-10,  1987:  Shapiro,  Srihari,  Geller,  Taie,  Campbell. 

.NAIC  Summer  Workshop,  Minnowbrook  Convention  Center,  NY,  June  29  -  July  2,  1987: 
Shapiro,  Srihari,  Geller,  Hettinger. 

Tenth  International  Joint  Conference  on  Artificial  Intelligence-87  (IJCAI-87),  Milan,  Italy, 
August  23-28,  1987;  Shapiro,  Geller. 

SUNYAB  VMES  -  NLP  Meetings,  Buffalo,  NY,  September  16-17,  1987:  Shapiro. 

Natural  Language  Planning  Workshop,  Minnowbrook  Conference  Center,  NY,  September  20-23, 
1987:  Shapiro. 

.NAIC  Fall  Conference,  Potsdam,  NY,  October  2, 1987:  Srihari,  Kumar,  Ali. 

SNePS  Workshop,  C.  Mass,,  December  1,  1987:  Shapiro. 

NAIC  Executive  Committee  Meeting,  SUNY  at  Buffalo,  NY,  December  3,  1987:  Srihari,  Spahr. 
SUNYAB  VMES  -  NLP  Meetings,  Buffalo,  NY,  December  2-4,  1987:  Shapiro. 

Concurrent  Common  Lisp  Workshop,  Beaverton,  Oregon,  December  9-11,  1987;  Campbell. 
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2B.8  CONFERENCES  ATTENDED,  PAPERS  PRESENTED 


Conference  on  AI  Applications  by  IEEE 
Orlando,  Florida,  February  25-27,  1987 

presented  M.  R.  Taie  Modeling  Connections  for  Circuit  Diagnosis 

1987  Reliability  and  Maintainability  Symposium 
Philadelphia,  PA,  January  27-29,  1987, 

presented  S.  C.  Shapiro  Knowledge  Based  Modeling  of  Circuit  Boards 


Second  UB  Gradtiate-Conference  on  Computer  Science 

Buffalo,  NT.  March  10,  1987 
chairmen  S.  S.  Campbell 

editors  J.  Geller 

K.  E.  Bettinger 


presented  T.  F.  Pawlicki 

J.  M.  Mellor-Crummey 
S.  Wroblewski,  T.  Thomas 

C. -H.  Wang 

D.  Horton,  G.  Hirst 
N.  Wahl,  S.  Miller 

D.  Walters,  G.  Krishnan 
B.  Selman 
S  Svorou 

Y.  Jang,  H.  K.  Hung 

J.  Tenenberg,  L.  Hartman 

Z.  Xiang 


The  Representation  of  Visual  Knowledge 
Parallel  Program  Debugging  with  Partial  Orders 
Efficient  Trouble  Shooting  in  an  Industrial  En¬ 
vironment 

ABLS:  An  Object  Recognition  System  for  Locat¬ 
ing  Address  Blocks  on  Mail  Pieces 
Presuppositions  as  Beliefs:  A  New  Approach 
Hypercube  Algorithms  to  determine  Geometric 
Properties  of  Digitized  Pictures 
Bottom-up  Analysis  for  Color  Separation 
Vivid  Representations  and  Analogues 
The  Spatial  Extension  Terms  in  Modem  Greek 
Semantics  of  a  Recursive  Procedure  with  Param¬ 
eter  and  Aliasing 

Naive  Physics  and  the  Control  of  Inference 
Multi-level  Model-based  Diagnosis  Reasoning 


Seventh  International  Workshop  on  Expert  Systems 

Avignon,  France,  May  13-15, 1987 

presented  S.  N.  Srihari  Tutorial  on  Spatial  Knowledge  Representation 

and  Reasoning 


Sixth  National  Conference  on  AI 
Seattle,  WA,  July  13-17,  1987 
presented  J.  J.  Hull 

G.  Krishna 

other  attendees  S.  N.  Srihari 
D.  Walters 
D.  Niyogi 
K.  Bettinger 
D.  Kumar 
Z.  Dobes 


Hypothesis  Testing  in  a  Computationtd  Theory 
of  Visual  Word  Recognition 
Adding  Vision  Capabilities  to  a  Computer 
Graphics  System  for  Color  Separation 
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AAAI  Workshop  on  Blackboard  Architectures 

Seattle,  WA,  July  15,  1987 

presented  C.-H.  Wang  Object  Reception  m  Structured  and  Random  En¬ 

vironments  Using  a  Blackboard  Architecture 


other  attendees  S.  N.  Srihari 


International  Joint  Conference  on  Artificial  Intelligence 

Milan,  Italy,  August  22-24,  1987 
presented  J.  J.  Hull 

R.  M.  Bozinovic 
J.  Geller 
S  Peters 

other  attendees  S.  N.  Srihari 

S.  C.  Shapiro 

SPSETs  40th  Annual  Conference  and  Symposium  on  Hybrid  Image  Systems 

Rochester,  NY,  May  17-22,  1987 

presented  S.  N.  Srihari  Document  Image  Analysis:  An  Overview  of  Al¬ 

gorithms 

J.  J.  Hull  A  Computationsil  Model  for  Human  and 

Machine  Reading 

International  Workshop  on  Expert  Systems  and  Pattern  Recognition 

Novosibirsk,  USSR,  Oct  25-28,  1987 

presented  S  N.  Srihari  Spatial  Knowledge  Representation  and  Reasoning 


Knowledge  Utilization  in  Handwritten  Zip  Code 
Recognition 

A  Multi-level  Perception  Approach  to  Reading 
Cursive  Script 

Graphical  Deep  Knowledge  for  Intelligent 
Machine  Drafting 

A  Representation  for  Natural  Catagory  Systems 
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2B.9  OTHER  NON-FUNDED  ACTIVITIES  AND  PAPERS 
BY  NAIC  SUPPORTED  RESEARCHERS 


The  Encyclopedia  of  Artificial  Intelligence,  edited  by  Dr.  S.  C.  Shapiro,  was  published  by 
John  Wiley  &  Sons,  Inc^  New  York,  1987. 

Dr.  Srthari  presented  his  paper  (not  sponsored  by  RADC),  “Spatial  Knowledge  Representa¬ 
tion  and  Reasoning”,  at  the  Seventh  International  Workshop  Conference  on  Expert  Systems  and 
Their  Applications,  in  Avignon,  France,  May  13-15,  1987.  He  is  sponsored  by  ECCAl  -  Euro¬ 
pean  Coordinating  Committee  for  Artificial  Intelligence. 

The  Second  Annual  University  at  Buffalo  Graduate-Conference  on  Computer  Science  was 
held  on  March  10,  1987.  This  is  a  one  day  conference  organized  and  presented  completely  by 
SUNWAB  Computer  Science  graduate  students.  Scott  Campbell  was  the  chairman;  Keith  Bet- 
tinger  and  James  GeUer  were  included  on  the  committee.  Twelve  graduates  from  four  depart¬ 
ments  in  three  universities  presented  their  research  work  in  C.S.  -  eight  of  which  were  AI 
related  talks.  In  order  to  continue  and  expand  on  NAICs  principle  of  enhanced  communication 
between  member  institutions,  two  of  the  presenters  were  students  at  the  University  of  Roches¬ 
ter.  We  feel  that  this  event  has  helped  to  strengthen  lines  of  communication  between  our  two 
Universities,  The  event  was  well  attended  with  a  total  146  attendees  -  a  21.6%  increase  over 
last  year’s  attendance.  Many  of  the  attendees  were  from  local  businesses  and  universities,  who 
saw  how  much  our  involvement  in  the  NAIC  has  strengthened  our  department,  as  well  as 
what  is  going  on  in  the  NAIC. 

Dr.  Shapiro  went  to  Bolling  AFB,  Washington,  D.  C.,  April  3-4  (sponsored  by  the 
National  Research  Council,  Board  on  Mathematical  Sciences)  to  attend  a  meeting  of  the  Review 
Panel  for  the  Research  Program  of  the  Mathematical  and  Information  Sciences  Directorate, 
AFOSR. 

Dr.  Shapiro  demonstrated  one  of  the  Explorers  at  the  UB  Freshman  Open  House,  April  11. 

Dr.  Shapiro  presented  “CASSEE:  Development  of  a  Computational  Mind’\  Department  Col¬ 
loquium,  Department  of  Computer  Science,  SUNY  at  Albany,  April  8,  1987. 

Dr.  Shapiro  led  a  discussion  on  AI  with  a  group  of  undergraduate  students  of  Wilkeson 
Quadrangle,  UB,  as  part  of  the  FAST  (Faculty  and  Students  Talking)  series,  April  28. 

Dr.  Shapiro  presented  ‘Toward  a  Computational  Mind”,  at  a  meeting  of  the  Niagara  Fron¬ 
tier  Chapter  of  the  ACM,  Buffalo,  NY,  May  7. 

Dr.  Srihari  (not  sponsored  by  RADC)  traveled  to  Avignon,  France,  May  9-15,  1987,  and 
presented  his  paper,  “Spatial  Knowledge  Representation  and  Reasoning”,  at  the  Seventh  Interna¬ 
tional  Workshop  Conference  on  Expert  Systems  and  Their  Applications.  He  was  sponsored  by 
ECCAl  -  European  Coordinating  Committee  for  Artificial  Intelligence. 

Keith  Bettinger  attended  AAAl-87,  July  13-17,  1987,  in  Seattle,  Washington  as  a 
volunteer. 

Dr.  Sargur  N,  Srihari  (not  funded  by  RADC)  attended  the  AAAI-87  Sixth  National 
Conference  on  Artificial  Intelligence  at  the  University  of  Washington,  Seattle  WA,  July  13-17, 
1987.  He  participated  in  the  Blackboard  Systems  Workshop. 

Dr.  Shapiro  co-authored  the  paper  (not  funded  by  RADC)  with  Sandy  Peters,  “A 
Representation  for  Natural  Category  Systems”,  presented  at  the  International  Joint  Conference 
on  Artificial  Intelligence-87  (UCAI-87)  in  Mil2m,  Italy,  August  23-28,  1987. 
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Sargur  N.  Srihari  and  Radmilo  Bozinovic  presented  a  paper  “A  Multilevel  Perception 
Approach  to  Cursive  Script  Recognition”,  at  the  Tenth  International  Joint  Conference  on 
Artificial  Intelligence-87  (IJCAI-87)  in  Milan,  Italy,  August  23-28,  1987. 

Jonathon  Hull  and  Sargur  N.  Srihari  presented  a  paper,  “Use  of  External  Information  in 
Zip-code  Recognition”,  also  at  the  same  conference,  the  Tenth  International  Joint  Conference  on 
Artificial  Inteiligence-87  (IJCAI-87)  in  MUan,  Italy,  August  23-28,  1987. 

Sargur  N.  Srihari  and  Radmilo  Bozinovic  have  published  a  paper  in  the  Artifidal  Intelli¬ 
gence  Journal,  October  1987  issue.  The  title  of  the  paper  is  the  same  as  the  UCAI  paper,  “A 
Multilevel  Perception  Approach  to  Cursive  Script  Recognition”. 

Dr.  Shapiro  spent  Sept.  16-17  at  the  Department  at  UB  meeting  with  the  VMES  and 
Natural  Language  Planning  research  groups. 

Dr.  Shapiro  attended  the  1987  Natural  Language  Planning  Workshop  in  the  Minnowbrook 
Conference  Center,  Sept.  20-23. 

Dr.  Shapiro  presented  “Semantic  Network  Based  Reasoning  Systems”  at  ISI,  Marina  del 
Rey,  CA,  on  &pt.  30, 1987. 

Dr.  Shapiro  gave  a  talk  XASSIE;  Development  of  a  Computational  Mind”  at  the  Com¬ 
puter  Science  Seminar,  Department  of  Computer  Science,  University  of  Southern  California, 
Oct.  28,  1987. 

The  paper  “A  Model  for  Belief  Revision”  by  Stuart  C.  Shapiro  and  Joao  P.  Martins  has 
been  accepted  for  publication  in  the  journal  Artificial  Intelligence,  and  is  tentatively  scheduled 
to  appear  in  Winter,  1988. 

Dr.  Sargur  N.  Srihari  (not  funded  by  NAIC)  attended  the  International  Workshop  on 
Expert  Systems  and  Pattern  Recognition,  USSR  Academy  of  Science  (Siberian  Section),  Novosi¬ 
birsk,  USSR,  October  26-30, 1987.  He  presented  a  paper  on  Spatial  Knowledge  Representation. 

A  paper  written  by  Srihari,  Wang,  Palumbo,  and  Hull  has  been  published  in  the  Dec.  18, 
1987  issue  of  AI  Magazine.  The  article  is  featured  as  the  lead  article  of  the  issue  on  the  cover. 
AI  Magazine  is  the  principal  publication  of  the  American  Association  of  Artificial  Intelligence. 

Professor  Stuart  C.  Shapiro  has  been  appointed  to  the  program  committee  of  the  First 
International  Conference  on  Principles  of  Knowledge  Represenution  and  Reasoning,  to  be  held 
in  Toronto,  Canada,  May  15-18,  1989. 


2B-3a 


2B.10  SELECTED  DEPARTMENT  ACTIVITIES  IN  1987 


2B.10.1  New  AI  Facility 

David  Sher,  Ph.D^  accepted  an  appointment  as  Assistant  Professor  starting  in  Fall,  1987. 
Dr.  Sher’s  research,  in  Computer  Vision,  was  part  of  Dr.  Chris  Brown’s  task  within  the  NAIC. 

2B.10.2  AI  Faculty 

AI  Faculty 

Six  of  the  fourteen  faculty  at  SUNY  at  Buffalo  are  AI.  They  include:  Shoshana  Hardt; 
William  J.  Rapaport;  David  E  Sher;  Stuart  C.  Shapiro;  Sargur  N.  Srihari;  Deborah  DiC.  Walters. 

2E10^  AI  Seminars  in  1987 

CS702  -  Walters 

Detection  and  Representation  of  Visual  Features  -  Spring  87 
CS703  -  Hardt 

Automating  Intelligent  Interaction  with  Complex  Worlds  -  Spring  1987 

CS705  -  Leyton 

Geometry  of  Robot  Planning 

CS706  -  Srihari 

Introduction  to  Connectionist/Neural  Network  Models 

2E10,4  Advanced  Degrees  Conferred  in  AI 

AI  MJ5.  degrees:  Brinkerhoff  L;  Campbell,  S,  C4  Chan,  C.  M4  Chen,  Y.  Y4  Chun,  S.  A4 
DeVinney,  Ga  Dodson-Simmons,  O4  Feuerstein,  S;  Gupta,  R.;  Jian,  H.;  Kim,  J4  Krishnaswamy, 
V.;  Kuo,  C.  K.;  Lang,  S.  L;  Lee,  H.  C4  Li,  N4  Li,  P4  Lively,  R4  Murty,  K4  Schwartz,  M4  Siracusa, 
T.;  Schneck,  N.  T.;  So,  H.  M.;  Thomas,  Ta  Wang,  G.;  Wroblewski,  S.;  Wu,  T.  Y.;  Wu,  W.  J. 

AI  PhJ),s: 

George  L.  Sicherman 

Thesis  Supervisor  Shoshana  L  Hardt 

"A  Model  of  Probabilistic  Inference  for  Decision-Making  Under  Risk  and  Uncertainty” 


Mingruey  R.  Taie 

Thesis  Supervisor  Sargur  N.  Srihari 

"Representation  of  Device  Knowledge  for  Versatile  Fault  Diagnosis" 


Michael  Almeida 

Thesis  Supervisor  Stuart  C.  Shapiro 

"Reasoning  about  the  Temporal  Structure  of  Narratives” 
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Appendix  Bl: 


"On  Knowledge  Representation  Using  Semantic 
Networks  and  Sanskrit" 


Srihari,  S.N. 
Rapaport,  WJ. 
Kumar,  D. 
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SEMANTIC  NETWORKS  AND  SANSKRIT 


S^.  Srihari,  WJ.  Rapaport,  D.  Kumar 


87-03 


February  1987 


Department  of  Computer  Science 
State  University  of  New  York  at  Buffalo 
226  BeU  Hall 
Buffalo,  New  York  14260 


Thu  work  was  supported  in  part  by  the  National  Science  Foundation  under  grant  no.  IST- 
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Griffiss  Air  Force  Base,  NY  13441  and  the  Air  Force  Office  of  Scientific  Research,  Bolling  AFB, 
DC  20332  under  contract  no.  F30602-8S-C-0008. 
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1.  INTRODUCTION 

Computational  linguistics  is  a  subfield  of  artificial  intelligence  (AI)  concerned  with  the  develop¬ 
ment  of  methodologies  and  algorithms  for  processing  natural -language  by  computer.  Methodologies  for 
computational  linguistics  are  largely  based  on  linguistic  theories,  both  traditional  and  modem. 
Recently,  there  has  been  a  proposal  to  utilize  a  traditional  method,  viz.,  the  shastric  Sanskrit  method  of 
analysis  (Briggs,  1985),  as  a  knowledge  representation  formalism  for  natural-language  processing.  The 
proposal  is  based  on  the  perceived  similarity  between  a  commonly  used  method  of  knowledge 
represenution  in  AI,  viz.,  semantic  networks,  and  the  shastric  Sanskrit  method,  which  is  remarkably 
unambiguous. 

The  influence  of  Sanskrit  on  traditional  Western  linguistics  is  acknowledgedly  significant  (Gelb, 
1985).  While  linguistic  traditions  such  as  Mesopotamian,  Chinese,  Arabic,  etc.,  are  largely  enmeshed 
with  their  particularities,  Sanskrit  has  had  at  least  three  major  influences.  First,  the  unraveling  of 
Indo-European  languages  in  comparative  linguistics  is  attributed  to  the  discovery  of  Sanskrit  by 
Western  linguists.  Second,  Sanskrit  provides  a  phonetic  analysis  method  which  is  vastly  superior  to 
Western  phonetic  tradition  and  its  discovery  led  to  the  systematic  study  of  Western  phonetics.  Third, 
and  most  important  to  the  present  paper,  the  rules  of  analysis  (e.g,  sutras  of  Panini)  for  compound 
nouns,  etc,  is  very  similar  to  contemporary  theories  such  as  those  based  on  semantic  networks. 

The  purpose  of  this  paper  is  threefold:  (i)  to  describe  propositional  semantic  networks  as  used  in 
AI,  as  well  as  a  software  system  for  semantic  network  processing  known  as  SNePS  (Shapiro,  1979),  (ii) 
to  describe  several  case  structures  that  have  been  proposed  for  natural-language  processing  and  which 
are  necessary  for  natural  language  undersunding  based  on  semantic  networks,  and  (iii)  to  introduce  a 
proposal  for  natural-language  translation  based  on  shastric  Sanskrit  and  semantic  networks  as  an 
interlingua. 

2.  KNOWLEDGE  REPRESENTATION  USING  ."SFM  ANTIC  NETWORKS 
2,1.  Semantic  Networks 

A  semantic  network  is  a  method  of  knowledge  representation  that  has  associated  with  it  pro¬ 
cedures  for  representing  information,  for  retrieving  information  from  it,  and  for  performing  inference 
with  it.  There  are  at  least  two  sorts  of  semantic  networks  in  the  AI  literature  (sec  Findler  1979  for  a 
survey):  The  most  common  is  what  is  known  as  an  “inheritance  hierarchy,”  of  which  the  most  well- 
known  is  probably  KL-ONE  (cf.  Brachman  &  Schmolze  1985).  In  an  inheritance  semantic  network, 
nodes  represent  concepts,  and  arcs  represent  relations  between  them.  For  instance,  a  typical  inheritance 
semantic  network  might  represent  the  propositions  that  Socrates  is  human  and  that  humans  are  mortal 
as  in  Figure  1(a).  The  interpreters  for  such  systems  zdlow  properties  to  be  “inherited,"  so  that  the  fact 
that  Socrates  is  mortal  does  not  also  have  to  be  stored  at  the  Socrates-node.  What  is  essential,  however, 
is  that  the  representation  of  a  proposition  (e.g,  that  Socrates  is  human)  consists  only  of  separate 
representations  of  the  individuals  (Socrates  and  the  property  of  being  human)  linked  by  a  relation  arc 
(the  TSA”  arc).  That  is,  propositions  are  not  themselves  objects. 

[Figure  1  here] 

In  a  propositional  semantic  network,  all  information,  including  propositions,  is  represented  by 
nodes.  The  benefit  of  representing  propositions  by  nodes  is  that  propositions  about  propositions  can  be 
represented  with  no  limit.  Thus,  for  example,  the  information  represented  in  the  inheritance  network 
of  Figure  1(a)  could  (though  it  need  not)  be  represented  as  in  Figure  1(b):  the  crucial  difference  is  that 
the  propo«:’:ional  network  conuins  nodes  (m3.  m5)  representing  the  propositions  that  Socrates  is 
human  and  that  humans  are  mortal,  thus  enabling  representations  of  beliefs  and  rules  about  those  pro¬ 
positions. 
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2^  SSePS 

SNePS,  the  Semantic  Network  Processing  System,  is  a  knowledge-represenution  and  reasoning 
software  system  based  on  propositional  semantic  networks.  It  has  been  used  to  model  a  cognitive 
agent’s  understanding  of  natural-language,  in  particular.  English  (Shapiro  1979;  Maida  &  Shapiro  1982; 
Shapiro  &  Rapaport  1986,  1987;  Rapaport  1986).  SNePS  is  implemented  in  the  USP  programming 
language  and  currently  runs  in  Unix-  and  USP-machine  environments. 

Arcs  merely  form  the  underlying  syntactic  structure  of  SNePS.  This  is  embodied  in  the  restric¬ 
tion  that  one  cannot  add  an  arc  between  two  existing  nodes.  That  would  be  tantamount  to  telling 
SNePS  a  proposition  that  is  not  represented  as  a  node.  Another  restriction  is  the  Uniqueness  Principle: 
There  is  a  one-to-one  correspondence  between  nodes  and  represented  concepts.  This  principle  guaran¬ 
tees  that  nodes  will  be  shared  whenever  possible  and  that  nodes  represent  intensional  objects.  (Shapiro 
&  Rapaport  1987.) 

SNePS  nodes  that  only  have  arcs  pointing  to  them  are  considered  to  be  unstructured  or  atomic. 
They  include:  (l)  sensory  nodes,  which — when  SNePS  is  being  used  to  model  a  cognitive  agent — 
represent  interfaces  with  the  external  world  (in  the  examples  that  follow,  they  represent  utterances); 
(2)  base  nodes,  which  represent  individual  concepts  and  properties;  and  (3)  variable  nodes,  which 
represent  arbitrary  individuals  (Fine  1983)  or  arbitrary  propositions. 

Molecular  nodes,  which  have  arcs  emanating  from  them,  include:  (l)  structured  individual  nodes, 
which  represent  structured  individual  concepts  or  properties  (Le^  concepts  and  properties  represented  in 
such  a  way  that  their  internal  structure  is  exhibited) — for  an  example,  see  Section  3,  below;  and  (2) 
structured  proposition  nodes,  which  represent  propositions;  those  with  no  incoming  arcs  represent 
beliefs  of  the  system.  (Note  that  structured  proposition  nodes  can  also  be  considered  to  be  structured 
individuals.)  Proposition  nodes  are  either  atomic  (representing  atomic  propositions)  or  arc  rule  nodes. 
Rule  nodes  represent  deduction  rules  and  are  used  for  node-based  deductive  inference  (Shapiro  1978; 
Shapiro  &  McKay  1980;  McKav  A  Shapiro  1981;  Shapiro,  Martins,  &  McKay  1982).  For  each  of  the 
three  categories  of  molecular  nodes  (structured  individuals,  atomic  propositions,  and  rules),  there  are 
constant  nodes  of  that  category  and  pattern  nodes  of  that  category  representing  arbitrary  entities  of 
that  category. 

There  are  a  few  built-in  arc  labels,  used  mostly  for  rule  nodes.  Paths  of  arcs  can  be  defined, 
allowing  for  path-based  inference,  including  property  inheritance  within  generalization  hierarchies 
(Shapiro  1978,  Srihari  1981).  All  other  arc  labels  are  defined  by  the  user,  typically  at  the  beginning  of 
an  interaction  with  SNePS  In  fact,  since  most  arcs  are  user-defined,  users  are  obligated  to  provide  a 
formal  syntax  and  semantics  for  their  SNePS  networks.  We  provide  some  examples,  below. 

Syntax  and  Semantics  of  SNePS 

In  this  section,  we  give  the  syntax  and  semantics  of  the  nodes  and  arcs  used  in  the  interaction. 
(A  fuller  presentation,  together  with  the  rest  of  the  conveisation,  is  in  Shapiro  &  Rapaport  1986, 
1987.) 

(Def.  1)  A  node  dominates  another  node  if  there  is  a  path  of  directed  arcs  from  the  first  node  to  the 
second  node. 

(Def.  2)  A  pattern  node  is  a  node  that  dominates  a  variable  node. 

(Def.  3)  An  individual  node  is  either  a  base  node,  a  variable  node,  or  a  structured  consunt  or  pattern 
individual  node. 

(Def.  4)  A  proposition  node  is  either  a  structured  proposition  node  or  an  atomic  variable  node 
representing  an  arbitrary  proposition. 

(Syn.l)  If  a;  is  a(n  English)  word  and  t  is  an  identifier  not  previously  used,  then 
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is  a  network,  to  is  a  sensory  node,  and  i  is  a  structured  individual  node. 

(Sein.1)  i  is  the  object  of  thought  corresponding  to  the  utterance  of  to. 

(S7n,2)  If  either  (f  *>^<1  identifiers  not  previously  used,  or  it  is  an  identifier  not  previously  used 
and  <2  is  a  temporal  node,  then 

yr>,  BEFORE  . 


is  a  network  and  1 1  and  <2  ue  temporal  nodes,  Le.  individual  nodes  representing  times. 

(Seni,2)  t  ]  and  I2  ^  objects  of  thought  coneapnnding  to  two  times,  the  former  occurring  before  the 
latter. 

(SynJ)  If  i  and  J  are  individual  nodes  and  m  is  an  identifier  not  previously  used,  then 

PR0PER1V  x-v  OBJECT 


is  a  network  and  m  is  a  structured  proposition  node. 

(Sem,6)  m  is  the  object  of  thought  corresponding  to  the  proposition  that  (the  class  of)  is  are  (a  sub¬ 
class  of  the  class  of)  js. 

(Syn.7)  If  i,  ,  i2  .  >3  are  individual  nodes,  t] ,  <2  are  temporal  nodes,  and  m  is  an  identifier  not  previ¬ 
ously  used,  then 

is  a  network  and  m  is  a  structured  proposition  node. 

(Sem.7)  m  is  the  object  of  thought  corresponding  to  the  proposition  that  agent  i,  performs  act  i2  with 
respect  to  ij  starting  at  time  t]  and  ending  at  time  t2,  where  tj  is  before  t^. 
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(Senu7)  m  is  the  object  of  thought  corresponding  to  the  proposition  that  agent  t,  performs  act  with 
respect  to  ^3  starting  at  time  1 1  and  ending  at  time  1 3,  where  t  ]  is  before  tj. 

3.  NATXniAL-LANGUAGE  UNDERSTANDING  USING  SEMANTIC  NETWORKS 

Semantic  networks  can  be  used  for  natural-language  understanding  as  follows.  The  user  inputs 
an  English  sentence  to  an  augmented-transition-network  (ATN)  grammar  (Woods,  1970,  Shapiro  1982). 
The  parsing  component  of  the  grammar  updates  a  previously  existing  knowledge  base  containing 
semantic  networks  (or  builds  a  new  knowledge  ba^  if  there  was  none  before)  to  represent  the 
system’s  understanding  of  the  input  sentence.  Note  that  this  is  semantic  analysis,  not  syntactic  pars¬ 
ing.  The  newly  built  node  representing  the  proposition  (or  a  previously  existing  node,  if  the  input 
sentence  repeated  information  already  stored  in  the  knowledge  base)  is  then  passed  to  the  generation 
component  of  the  ATN  grammar,  which  generates  an  English  sentence  expressing  the  proposition  in 
the  context  of  the  knowledge  base.  It  should  be  noted  that  there  is  a  single  ATN  parsing-generating 
grammar;  the  generation  of  an  English  output  sentence  from  a  node  is  actually  a  process  of  "parsing” 
the  node  into  English.  If  the  input  sentence  expresses  a  question,  information-retrieval  and  inferencing 
packages  are  used  to  find  or  deduce  an  answer  to  the  question.  The  node  representing  the  answer  is 
then  passed  to  the  generation  granunar  and  expressed  in  English. 

Here  is  a  sample  conversation  with  the  SNePS  system,  together  with  the  networks  that  are  built 
as  a  resulL  User  input  is  on  lines  with  the  :-prompt;  the  system’s  output  is  on  the  lines  that  follow. 
Comments  are  enclosed  in  brackets. 

;  Young  Lucy  petted  a  yellow  dog 
I  understarKd  that  young  Lucy  petted  a  yellow  dog 

[The  system  is  told  something,  which  it  now  "believes.”  Its  entire  belief  structure 
consists  of  nodes  bl,  ml-ml3,  and  the  corresponding  sensory  nodes  (Figure  3).  The 
node  labeled  "now”  represents  the  current  time,  so  the  petting  is  clearly  represented 
as  being  in  the  past  The  system’s  response  is  "I  understand  that”  concateiuted  with 
its  English  description  of  the  proposition  just  entered.] 

:  What  is  yellow 
a  dog  is  yellow 

[This  response  shows  that  the  system  actually  has  some  beliefs;  it  did  not  just  parrot 
back  the  above  sentence.  The  knowledge  base  is  not  updated,  however.] 

:  Dogs  are  ainimals 
I  understand  that  dogs  are  animals 

[The  system  is  told  a  small  section  of  a  class  hierarchy;  this  information  does  update 
the  knowledge  base.] 


[Figure  3  here] 

There  are  three  points  to  note  about  the  use  of  SNePS  for  natural-language  understanding.  First, 
the  system  can  “understand”  an  English  sentence  and  express  its  understanding;  this  is  illustrated  by 
the  first  part  of  the  conversation  above.  Second,  the  system  can  answer  questions  about  what  it  under¬ 
stands;  this  is  illustrated  by  the  second  part.  Third,  the  system  can  incorporate  new  information  into 
its  knowledge  base;  this  is  illustrated  by  the  third  part. 

Case  Frames 

Implicit  in  such  a  language  understanding  system  are  so-called  case  frames.  We  give  a  brief 
summary  here;  for  a  more  thorough  treatment  see  Winograd  (1983). 
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Case-based  deep  structure  analysis  of  English  was  suggested  by  Fillmore  (1968).  The  surface 
structure  of  English  relies  only  on  the  order  of  constituents  and  propositions  in  a  clause  to  indicate 
role.  Examples  are: 

Your  dog  just  bit  my  mother. 

My  mother  just  bit  your  dog. 

In  Russian,  Sanskrit,  etc„  explicit  markings  are  used  to  represent  relationships  between  participants. 
Examples  in  Russian,  which  uses  six  cases  (nominative,  genitive,  dative,  accusative,  instrumental,  and 
prepositional),  are: 

Professor  uchenika  tseloval  (the  professor  kissed  the  student). 

Professora  uchenik  tseloval  (the  student  kissed  the  professor). 

The  extremely  limited  surface  case  system  of  English  led  Fillmore  to  suggest  cases  for  English 
deep  structure  as  follows:  Agentive  (animate  instigator  of  action).  Instrument^  (inanimate  force  or 
object  involved).  Dative  (animate  being  affected  by  action),  F-;ictitive  (object  resulting  from  action). 
Locative  (location  or  orientation),  and  Objective  (everything  else).  For  example,  consider  the  sentence: 

John  opened  the  door  with  the  key. 

Its  case  analysis  yields:  Agentive  -  John,  Objective  -  the  door.  Instrumental  -  the  key. 

Schank  (1975)  developed  a  represenution  for  meaning  (conceptual  dependency)  based  on 
language  independent  conceptual  relationships  between  objects  and  actions:  case  roles  filled  by  objects 
(actor,  object,  attribuant,  recipient),  case  roles  filled  by  conceptualizations  (instrument,  attribute,  ...  ). 
and  case  roles  filled  by  other  conceptual  categories  (time,  location,  sute).  For  example: 

John  handed  Mary  a  book. 

has  the  analysis:  Actor  -  John,  Donor  ■  John,  Recipient  -  Mary,  Object  -  book.  Instrument  -  an  action 
of  physical  motion  with  actor  -  John  and  object  -  band. 

4.  SANSKRIT  CASE  FRAMES  AND  SEMANTIC  NETWORKS 

In  the  previous  section  we  noted  that  natural-language  understanding  based  on  semanitc  net¬ 
works  involves  determining  what  case  frames  will  be  used.  The  current  set  of  case  frames  used  in 
SNePS  is  not  intended  to  be  a  complete  sen  Thus,  we  propose  here  that  sbastric  Sanskrit  case  frames, 
implemented  as  SNePS  networks,  make  an  ideal  knowledge-represenution  “language.” 

There  are  two  distinct  advantages  to  the  use  of  classical  Sanskrit  analysis  techniques.  First,  and 
of  greatest  importance,  it  is  not  an  ad  hoc  method.  As  Briggs  (1985)  has  observed,  Sanskrit  grammari¬ 
ans  have  developed  a  thorough  system  of  semantic  analysis.  Why  should  researchers  in  l^owledge 
representation  and  natural-language  understanding  reinvent  the  wheel?  (cf.  Rapaport  1986).  Thus,  we 
propose  the  use  of  case  frames  based  on  Sanskrit  grammatical  analysis  in  place  of  (or,  in  some  cases,  in 
addition  to)  the  case  frames  used  in  current  SNePS  natural-language  research. 

Second,  and  implicit  in  the  first  advantage,  Sanskrit  grammatical  analyses  are  easily  implement- 
able  in  SNePS  This  should  not  be  surprising.  The  Sanskrit  analyses  are  case-based  analyses,  similar, 
for  example,  to  those  of  Fillmore  (1968).  Propositional  semantic  networks  such  as  SNePS  are  based  on 
such  analyses  and,  thus,  are  highly  suitable  symbolisms  for  implementing  them. 

As  an  example,  consider  the  analysis  of  the  following  English  translation  of  a  Sanskrit  sentence 
(from  Briggs  1985): 

Out  of  friendship,  Maitra  cooks  rice  for  Devadatta  in  a  pot  over  a  fire. 

Briggs  offers  the  following  set  of  “triples,"  that  is,  a  linear  representation  of  a  semantic  network  for 
this  sentence  (Briggs  1985:  37,  38); 

cause,  event,  friendship 
friendship,  object  1,  Devadatta 
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friendship,  object2.  Maitra 
cause,  result  cook 
cook,  agent  Maitra 
cook,  recipient  Maitra 
cook,  instrumient  fire 
cook,  object  rice 
cook,  on-loc,  pot 

But  what  is  the  syntax  and  semantics  of  this  knowledge-representation  scheme?  It  appears  to  be 
rather  ad  hoc.  Of  course,  Briggs  only  introduces  it  in  order  to  compare  it  with  the  Sanskrit  grammati¬ 
cal  analysis,  so  let  us  concentrate  on  that,  instead.  Again  using  triples,  this  is: 

cook,  agent  Maitra 
cook,  object  rice 
cook,  instrument  fire 
cook,  recipient  Oevadatta 
cook,  because-of,  frierKlship 
friendship,  Maitra.  Oevadatta 
cook,  locality,  pot 

Notice  that  all  but  the  penultimate  triple  begins  with  cook.  The  triple  beginning  with  friendship  can 
be  thought  of  as  a  structured  individual:  the  friendship  between  Maitra  and  Oevadatta.  Implemented 
in  SNePS,  this  becomes  the  network  shown  in  Figure  4.  Node  mil  represents  the  structured  individual 
consisting  of  the  relation  of  friendship  holding  between  Maitra  and  Oevadatta.  Node  ml3  represents 
the  proposition  that  an  agent  (named. Maitra.)  performs  an  act  (cooking)  directed  to  an  object  (rice), 
using  an  instrument  (fire),  for  a  recipient  (named  Oevadatu).  at  a  locality  (a  pot),  out  of  a  cause  (the 
friendship  between  the  agent  and  the  recipient). 

Such  an  analysis  caiL  presumably,  be  algorithmically  derived  from  a  Sanskrit  sentence  and  can 
be  algorithmically  transformed  back  into  a  Sanskrit. sentence.  Since  an  English  sentence,  for  insunce, 
can  also  presumably  be  analyzed  in  this  wav  (at  the  very  least,  sentences  of  Indo-European  languages 
should  be  easily  analyzable  in  this  fashion),  we  have  the  basis  for  an  interlingual  machine-translation 
system  grounded  in  a  well-established  semantic  theory. 

[Figure  4  here] 


5.  INTERLINGUAL  MACHINE  TRANSLATION 

The  possibility  of  translating  natural-language  texts  using  an  intermediate  common  language 
was  suggested  by  Warren  Weaver  (1949).  Translation  using  a  common  language  (an  “interlingtia’’)  is  a 
two-suge  process:  from  source  language  to  an  interlingua,  and  from  the  interlingua  to  the  target 
language  (Figure  S).  This  approach  is  characteristic  of  a  system  in  which  representation  of  the  “mean¬ 
ing"  of  the  source-language  input  is  intended  to  be  independent  of  any  language,  and  in  which  this 
same  representation  is  used  to  synthesize  the  target-language  outpuL  In  an  adtem^tive  approach  (the 
“transfer"  approach),  the  results  of  source  text  analysis  are  converted  into  a  corresponding  representa¬ 
tion  for  target  text,  which  is  then  used  for  outpuL  Figure  6  shows  how  the  interlingua  (indirect) 
approach  compares  to  other  (direct  and  transfer)  approaches  to  machine  translation  (MT).  The  inter¬ 
lingua  approach  to  translation  was  heavily  influenced  by  formal  linguistic  theories  (Hutchins  1982). 
This  calls  for  an  interlingua  to  be  formal,  language-independent,  and  “adequate"  for  knowledge 
representation. 


[Figures  5  and  6  here] 
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Various  proposals  for  interlinguas  have  included  the  use  of  formalized  natural-language,  artificial 
“international'*  languages  like  Esperanto,  and  various  symbolic  representations.  Most  prior  work  on 
interlinguas  has  centered  on  the  representation  of  the  lexical  content  of  text.  Bennet  et  (1986)  point 
out  that  a  large  portion  of  synuctic  structures,  even  when  reduced  to  “canonical  form,"  remain  too 
language-specific  to  act  as  an  interlingua  representation.  Thus,  major  disadvantages  of  an  interlingua- 
bas^  system  result  from  the  practical  difficulty  of  actually  defining  a  language-free  interlingua 
representation. 

Besides,  none  of  the  existing  MT  systems  use  a  significant  amount  of  semantic  information  (Slo¬ 
cum  1985).  Thus,  the  success  of  an  interlingua  depends  on  the  nature  of  the  interlingua  as  well  as  the 
analysis  rendered  on  the  source  text  to  obtain  its  interlingual  representation.  This  made  the  inter¬ 
lingua  approach  too  ambitious,  and  researchers  have  inclined  more  towards  a  transfer  approach. 

It  has  been  argued  that  analyses  of  natural-language  sentences  in  semantic  networks  and  in  San¬ 
skrit  grammar  is  remarkably  similar  (Briggs  1985).  Thus  we  propose  an  implementation  of  Sanskrit 
in  a  semantic  network  to  be  used  as  an  interlingua  for  MT.  As  an  interlingua,  Sanskrit  fulfills  the 
basic  requirements  of  being  formal,  language-independent,  and  a  powerful  medium  for  representing 
meaning. 
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Figure  1(a).  An  inhaitance-liierarcfay  semantic  n^work 


Figure  1(b).  A  SNePS  propositional  semantic  network  (m3  and  mS  rqpresent  the 
propositions  that  Socrates  is  human  and  that  humans  are  mortal,  respectively) 
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BEFORE 


Figure  3.  SNePS  network  for  ‘‘Young  Lucy  petted  a  yellow  dog. 
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Figure  5.  The  Ihterlingua  Approaches 
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Figure  6.  Various  Approaches  to  MT 
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Knowledge-Based  Parsing 

J.G.Neal  and  S.C  Shapiro 


Abstract  An  extremely  significant  feature  of  any  natural  language  (NL)  is  that  it 
is  its  own  metalanguage.  One  can  use  an  NL  to  talk  about  the  NL  itself.  One  can 
use  an  NL  to  tutor  a  non-native  speaker,  or  other  poor  language  user,  in  the  use  of 
the  same  NL  We  have  been  exploring  methods  of  knowledge  representation  and 
NL  understanding  (NLU)  whi<^  would  allow  an  artificial  intelligence  (AI)  system 
to  play  the  role  of  poor  language  user  in  this  setting.  The  AI  system  would  have  to 
understand  NL  utterances  about  how  the  NL  is  used,  and  improve  its  NLU  abili¬ 
ties  according  to  this  instruction.  It  would  be  an  NLU  system  for  which  the 
domain  being'discussed  in  NL  is  the  NL  itself. 

Our  NLU  system  is  implemented  in  the  form  of  a  general  rule-based  inference 
system  which  reasons  according  to  the  rules  of  its  knowledge  base.  These  rules 
comprise  the  system's  knowledge  of  language  understanding  in  the  same  way  that 
the  rules  of  any  rule-based  system  comprise  that  system's  knowledge  of  its  domain 
of  application.  Our  system.uses  the-same  knowledge  base  for  both  linguistic  and 
other  knowledge  since  we  feel  that  there  is  no  clear  boundary  line  separating  syn¬ 
tactic,  semantic,  and  world  knowledge. 

We  are  exploring  the  possibility  of  an  NLU  system's  becoming  more  fadle  in  its 
use  of  some  language  by  being  told  how  that  language  is  used.  We  wish  this  expla¬ 
nation  to  be  given  in  an  increasingly  sophisticated  subset  of  the  language  being 
taught  Clearly,  ^e  system  must  start  with  some  language  facility,  but  we  are  inter¬ 
ested  in  seeing  how  small  and  theory-independent  we  can  make  the  initial,  "ker¬ 
nel”  language.  This  article  reports  the  current  state  of  our  work. 


1  Introduction 
1.1  Overview 

An  extremely  significant  feature  of  any  natural  language  (NL)  is  that  it  is  its  own 
metalanguage.  One  can  use  an  NL  to  talk  about  the  NL  itself.  One  can  use  an  NL 
to  tutor  a  non-native  speaker,  or  other  poor  language  user,  in  the  use  of  the  same 
NL  We  have  been  exploring  methods  of  knowledge  representation  (KR)  and  NL 
understanding  (NLU)  which  would  allow  an  artificial  intelligence  (AI)  system  to 
play  the  role  of  poor  language  user  in  this  setting.  The  AI  system  would  have  to 
understand  NL  utterances  about  how  the  NL  is  used,  and  improve  its  NLU  abili- 
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des  according  to  this  instruction.  It  would  be  an  NLU  system  for  which  the 
domain  being  discussed  in  NL  is  the  NL  itself. 

It  is  essential  to  our  approach  tc  have  the  system’s  ^/arsing  and  linguisdc  knowl¬ 
edge  be  an  integral  part  of  its  domain  knowledge.  Acknowledging  that  what  is 
meant  by  “meaning”  is  controversial  (Quine,  1948),  we  take  the  meaning  or  signifi¬ 
cance  of  a  word  or  phrase  to  include  linguistic  knowledge  about  the  word  or 
phrase.  For  example,  we  frel  that  how  a  word  like  “dog”  is  used  in  language  is  a 
part  of  its  “meaning”,  along  with  other  properties  such  as  the  fact  that  “dog” 
denotes  a  special  kind  of  animal  with  typical  characteristics.  The  implementation 
of  our  system  is  based  upon  the  above  stated  view  and  therefore  the  rules  and 
assertions  comprising  the  system’s  knowledge  of  language  understanding,  includ¬ 
ing  syntax,  is  integrated  into  the  system’s  knowledge  base  along  with  its  other  task 
domain  knowledge. 

We  are  exploring  the  possibility  of  an  NLU  system’s  becoming  more  facile  in  its 
use  of  some  language  by  being  taught  how  that  language  is  used.  The  teacher 
might  be  a  conversation  partner  who  happens  to  use  some  phrase  the  system  is  not 
yet  familiar  with,  or  a  language  theorist  who  wants  to  find  out  if  she  can  explain 
her  theory  completely  and  clearly  enough  for  the -system  to  use  language  accord¬ 
ing  to  it  We  wish  this  explanation  to  be  given  in  an  increasingly  sophisticated  sub¬ 
set  of  the  language  being  taught.  That  is,  why  not  test  and  make  use  of  the  sys¬ 
tem’s  language  capability  by  using  it  to  continue  the  system’s  “education”? 
Clearly,  the  system  must  start  with  some  language  facility,  but  we  are  interested  in 
seeing  how  small  and  theory-independent  we  can  make  the  initial,  “kernel”  lan¬ 
guage. 

In  this  chapter,  we  will  discuss  our  knowledge  representation  techniques,  the 
system's  kernel  language  (KL),  and  parsing  strategy.  We  will  demonstrate  how  our 
system  can  be  instructed  in  the  use  of  some  language  defined  by  the  teacher  and 
how  the  system’s  acquired  language  can  itself  be  used  as  its  own  metalanguage. 
The  kernel  language  only  incorporates  primitive  relations  such  as  one  token  being 
a  predecessor  of  another  in  a  string,  membership  in  a  lexical  or  string  category, 
and  constituency.  As  an  example  of  using  the  system’s  language  as  its  own  meta¬ 
language  to  enhance  its  language  capability,  we  will  demonstrate,  starting  with 
only  the  KL,  how  the  system  can  be  instructed  with  regard  to  the  number  (i.e.,  sin¬ 
gular  or  plural)  of  some  words  and  then  be  informed  that  “If  theTiead-noun  of  a 
noun-phrase  X  has  number  Y,  then  X  has  number  Y”.  This  newly  acquired 
knowledge  can  then  be  applied  by  the  system  to  infer  that  since  “glasses”  is  plural, 
so  is  “the  old  man’s  glasses"  when  it  reads  this  phrase  in  a  sentence  such  as  “The 
old  man’s  glasses  were  filled  with  water”. 

Our  system  is  able  to  understand  when  strings  arc  mentioned  in  input  utterances 
as  well  as  when  they  are  used  to  communicate  with  the  system.  This  capability  is 
demonstrated  frequently  in  this  chapter,  but  particularly  in  Sect.  4  with  the  classic 
sentence  from  Tarski  (1944);  “‘Snow  is  white’  is  true  if  and  only  if  snow  is  white". 

The  use  of  inference  and  world  knowledge  is  essential  for  a  system  to  parse  sen¬ 
tences  such  as  “John  saw  the  bird  without  binoculars"  and  “John  saw  the  bird 
without  tailfeathers”  from  Schubert  and  Pelletier  (1982)  or  “John  saw  the  man  on 
the  hill  with  a  telescope”.  Our  research  is  based  upon  the  concept  of  having  pars¬ 
ing  performed  by  a  general  reasoning  system  which  has  the  capability  of  applying 
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worid  knowledge  inferences  during  parsing,  since  the  “parser**  is  not  a  separate 
isolated  component  with  special  sublanguage,  representations,  or  knowledge 
base. 


1.2  Fundamental  Assumptions 

Our  system  incorporates  the  use-mention  distinction  (Quine,  1951)  for  language. 
Our  representations  reflect  the  fact  that  the  meaning  of  a  token  or  surface  string  is 
distinct  from  the  token  or  string  itself.  Our  system's  knowledge  base  maintains  a 
representation  for  a  token  or  surface  string  that  is  distinct  from  the  representation 
of  the  interpretation  of  the  input  token  or  string.  This  distinction  is  the  same  as 
between  a  numeral  and  a  number  in  mathematics.  To  refer  to  a  word  or  string 
rather  than  its  meaning,  the  user  must  use  the  usual  English  convention  of  prefac¬ 
ing  the  word  by  a  single-quote  mark  or  enclosing  the  string  in  quotation  marks. 
(See  Sections  2.2.1  and  2.4.2  for  more  information.) 

A  second  principle  upon  which  our  work  is  based  is  that  each  occurrence  of  a 
givcii  surface  string  in  the  input  stream  is  assumed  to  have  a  different  interpreta¬ 
tion,  unless  the  teacher  has  entered  rules  into  the  system  to  dictate  otherwise.  For 
example,  if  a  name  such  as  **John**  has  been  entered  into  the  lexicon  and  is  used 
twice,  either  in  successive  utterances  or  within  the  same  utterance,  then  the  system 
interprets  each  occurrence  of  the  name  as  referring  to  a  different  entity  unless  the 
teacher  has  instructed  the  system  otherwise.  Since  an  NLU  system  must  ’  ^  capa¬ 
ble  of  handling  ambiguities,  and,  in  a  situation  in  which  no  explicit  rules  are 
known  to  the  system  to  giude  it  in  determining  whether  a  word  or  phrase  is  ambig¬ 
uous,  it  must  have  a  default  procedure  to  follow,  we  have  chosen  to  implement  the 
above  principle.  Although  our  approach  would  seem  to  overiy  complicate  the  net¬ 
work,  it  is  a  reasonable  default  prindple  since  there  is  some  evidence  that  merging 
of  nodes  is  easier  than  splitting  nodes  (Maida  and  Shapiro,  1982). 

A  third  principle  which  is  fundamental  to  our  theory  is  that  all  possible  parses 
and  interpretations  of  a  surface  string  are  to  be  determined  according  to  the  lan¬ 
guage  definition  used  by  the  system.  We  feel  that  multiple  interpretations,  when 
justified  by  the  language  definition,  are  warrented  since  agile  human  minds  fre¬ 
quently  perceive  alternative  interpretations  and,  in  fact,  a  great  deal  of  humor  is 
dependent  upon  this. 

Our  system  does  not  currently  do  morphological  analysis.  One  of  the  ^eas  in 
which  we  plan  to  do  future  research  is  knowledge-based  morphological  analysis. 
We  plan  to  develop  a  system  component  that  would  perform  morphological  anal¬ 
ysis  and  function  as  a  preprocessor  or  coprocessor  with  the  system  discu‘:sed  in 
this  article. 


1 .3  Declarative  Knowledge  Representation  in  an  Integrated  Knowledge  Base 

Our  approach  is  to  represent  knowledge  in  declarative  form,  to  the  greatest  extent 
possible,  in  the  semantic  network  formalism.  This  applies  to  all  knowledge  includ¬ 
ing  linguistic  knowledge  and  the  rules  which  are  applied  by  the  inference  machine 
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to  guide  the  system's  reasoning,  the  parsing  process  being  one  manifestation  of  the 
system’s  reasoning  according  to  the  rules  of  its  network  knowledge  base.  It  is  our 
intent  that  the  system's  knowledge,  including  its  linguistic  knowledge,  be  available 
to  the  teacher  in  the  same  way  that  domain  knowledge  is  in  other  AI  systems. 

Furthermore,  the  declarative  form  is  a  more  suitable  form  for  linguistic  knowl¬ 
edge  in  theoretical  studies  of  language.  A  language  definition  or  description  is 
inherently  declarative,  and  as  Pereira  and  Warren  have  pointed  put:  “The  theorists 
have  concentrated  on  describing  what  natural  language  is,  in  a  dear  and  elegant 
way.  In  this  context,  details  of  how  natural  language  is  actually  recognized  or  gen¬ 
erated  need  not  be  relevant,  and  indeed  should  probably  not  be  allowed  to 
obscure  the  language  definition"  (1980,  p.269,  italics  in  the  original).  In  this 
regard,  a  dedarative  representation  is  preferable  to  a  formalism  such  as  an  ATN, 
in  that  the  ATN  is  a  description  of  a  process  for  recognizing  a  language. 

Our  system  uses  an  integrated  knowledge  base  for  both  linguistic  and  other 
knowledge  as  advocated  by  Pollack  and  Waltz  (1982)  and  by  Dahl  (1981).  As  indi¬ 
cated  in  Section  1.1,  we  take  the  meaning  of  a  word  or  phrase  to  indude  linguistic 
knowledge  about  the  word  or  phrase  and  its  use.  Furthermore,  we  feel  that  there  is 
no  dear  boundary  line  separating  syntactic,  semantic,  and  world  knowledge.  For 
example,  it  is  not  clear  to  what  extent  the  classification  of  words  into  lexical  cate¬ 
gories  depends  on  meaning,  function,  or  form.  Should  certain  words  be  dassifled 
as  mass  nouns  because  they  fit  certain  distributional  frames  or  have  a  certain  form 
(e.g.,  I  used  (sand,  the  sand,  a  bag  of  sand,  *a  sand,  *two  sands.})  or  are  the  frames 
and  forms  simply  a  reflection  of  the  property  we  think  of  as  diaractermng  the 
substances  named  by  mass  nouns,  namely  that  the  substance  is  not  naturally  phys¬ 
ically  bounded  and  that  when  two  amounts  of  the  substance  are  “put  together" 
they  become  one  amount?  Perhaps  certain  aspects  of  syntax  carmot  or  should  not 
be  separated  from  semantics.  Furthermore,  the  terms  “semantic  knowledge”  and 
“wo4d  knowledge"  seem  only  to  be  used  to  informally  express  a  measure  of  the 
sophistication  or  complexity  of  knowledge. 


1.4  System  Overview 

Our  Natural  Language  System  is  being  developed  and  implemented  using  the 
SNePS  semantic  network  processing  system  (Shapiro,  1979a;  Shapiro  and  the 
SNePS  Group,  1981).  The  terminology  and  representations  for  some  of  the  basic 
categories,  objects,  and  relations  of  this  work  evolved  from  a  preliminary  study 
reported  in  Shapiro  and  Neal  (1982).  Figure  1  illustrates  an  overview  of  the  sys¬ 
tem. 

The  semantic  network  formalism  has  been  used  by  many  researchers  for  knowl¬ 
edge  representation  (Quillian,  1968,  1969;  Rumelhart  and  Norman,  1973;  Sim¬ 
mons,  1973;  Woods,  1975;  P. Hayes  1977;  Schubert,  1976;  Hendrix,  1978,  1979; 
Schubert  et  aL,  1979;  Brachman,  1979).  In  contrast  to  other  semantic  network 
implementations,  the  SNePS  system  provides  a  uniform  declarative  representation 
for  both  rules  and  assertions  in  the  network  (Shapiro,  1971, 1979b).  Furthermore, 
our  system  comprises  an  effort  to  utilize  a  common  representation  for  problem¬ 
solving  and  language-comprehension  information  as  advocated  by  Chamiak 
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rig.1.  Overview  of  the  NL  system 


- >  I  ROLE  BASED  PARSER] 

i  T 

NETWORK  DATA  BASE  OF  ROLES  6  ASSERTIONS, 
PARSING  KNOWLEDGE,  &  WORLD  KNOWLEDGE 


CORE  I  USER 


(1981).  Our  system  is  similar  to  the  Prolog-based  (Roussel,  1975)  systems  of  War¬ 
ren  and  Pereira  (1982),  Dahl  (1979, 1981),  and  McCord  (1982)  in  that  it  is  imple¬ 
mented  in  a  logic-based  system  in  which  processing  is  a  form  of  inference.  The 
SNePS  inference  package  (Shapiro  et  al.,  1982),  however,  is  not  based  on  the  reso-' 
iution  principle  (Robinson,  1965)  as  is  Prolog,  but  on  a  muld-processing  approach 
(Kaplan,  1973;  McKay  and  Shapiro,  1980)  incorporating  a  producer-consumer 
model.  SNePS  also  provides  a  facility  for  ‘‘procedural  attachment**  in  rules  to  han¬ 
dle  processing  knowledge  for  whi^  the  declarative  netwoiic  representation  is 
urmatural. 

The  PSI-KLONE  system  (Bobrow  and  Webber;  1980)  uses  linguistic  knowledge 
represented  in  a  KLrONE  netwbric  (Brachman,  1978a,  1978b,  1979)  to  function  as 
semantic  interpreter  for  parsed  surface  strings.  The  PSI-KLONE  interpreter,  how¬ 
ever;  functions  in  cooperation  with  an  ATN  parser  in  the  RUS  framework 
(Bobrow,  1978).  In  contrast,  we  are  implementing  an  integrated  system  for  syntac¬ 
tic  and  semantic  processing  which  uses  a  uniform  representation  for  syntactic  and 
semantic  knowledge. 

The  rule-based  parser  of  Figure  1  is  essentially  the  SNePS  inference  package 
which  reasons  according  to  the  rules  of  the  knowledge  base. 

The  knowledge  base  consists  of  CORE  knowledge  and  USER  knowledge.  The 
CORE  knowledge  is  provided  by  the  designers  of  the  system  and  defines  a  kernel 
language  initially  acceptable  to  the  system.  USER  knowledge  results  from  the  pro¬ 
cessing  of  user  input  utterances. 

The  function  of  our  NL  parser  is  twofold: 

1.  derivation  of  zero  or  more  armotated  parse  trees  for  the  input  surface  string; 

2.  construction  of  a  networic  representation  for  the  interpretation  of  the  input 

utterance  from  the  annotated  parse  tree  and  from  other  relevant  knowledge 

from  the  network  data  base. 

The  above  two  functions  are  not  handled  by  separate  processors,  but,  instead,  are 
both  accomplished  by  the  SNePS  inference  package  as  a  result  of  the  application 
of  CORE  and  USER  rules.  The  processes  of  accomplishing  the  two  functions  are 
interrelated  and  can  cooperate.  The  interpretation  of  a  surface  string  will  depend 
on  how  it  is  syntactically  parsed  and,  conversely,  the  syntactic  parse  of  a  surface 
string  can  depend  on  the  meanings  of  related,  constitutent,  or  neighboring  strings. 
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The  two  processes  are  not  carried  out  in  a  purely  seouential  fashion  for  a  given 
input  utterance,  since  interpretations  can  be  constructed  for  parsed  constituent 
strings  before  the  parsing  of  the  entire  utterance  is  complete. 


1.5  Knowledge  Representation  Techniques 

A  SNePS  semantic  network  is  a  directed  graph  with  labeled  arcs  in  which  nodes 
represent  concepts  and  the  arcs  repr^ent  nonconceptual  binary  relations  between 
concepts.  It  is  generally  agreed  that  the  nodes  of  a  semantic  network  represent 
intensional  concepts  (Woods,  197S:  Brachman,  1977;  Maida  and  Shapiro,  1982). 
A  “concept”  is  something  in  our  domain  of  interest  about  which  we  may  want  to 
store  information  and  which  may  be  the  subject  of  “thought”  and  inference.  Since 
each  concept  is  represented  by  a  node,  the  relations  represented  by  the  arcs  of  our 
system  are  not  conceptual,  but  structural  (Shapiro,  1979  a). 

The  primary  type  of  arc  in  a  SNePS  network  is  the  descending  arc  and  if  there  is 
a  path  of  descending  arcs  from  node  N  to  node  M,  N  is  said  to  dominate  M.  Two 
important  types  of  nodes  are  molecular  and  atomic  nodes.  Molecular  nodes  are 
nodes  that  dominate  other  nodes.  Atomic  nodes  are  simply  not  molecular.  Atomic 
nodes  can  be  constant  (representing  a  unique  semantic  concept)  or  variable.  Vari¬ 
able  nodes  are  used  in  SNePS  as  variables  are  used  in  normal  predicate  logic 
notations.  Netwoiic  nodes  can  also  be  categorized  as  in  the  table  in  Figure  2. 

A  propositional  molecular  node  N  together  with  the  arcs  inddent  from  the  node 
and  the  nodes  .  .,Mk  immediately  dominated  by  N  correspond  to  a  case  frame 
(Fillmore,  1968;  Bruce,  1975)  where  the  arc  names  correspond  to  the  slot  names, 
and  the  nodes  M],. .  .,Mk  represent  the  slot  fillers.  Undominated  molecular  nodes 
in  a  SNePS  network  represent  propositions  believed  by  the  system.  Concepts  such 
as  the  following  are  propositional  and  are  represented  by  molecular  nodes:  Lex¬ 
eme  L  is  a  member  of  category  C;  SI  is  a  constituent  string  of  S2;  lexeme  L  has 
number  N  (i.e.  singular  or  plural).  Simple  examples  of  propositional  nodes  are 
Ml  and  M2  of  Figure  3. 

Node  Ml  represents  the  proposition  that  B1  represents  the  concept  expressed 
by  the  word  “NOUN”  and  M2  represents  the  proposition  that  the  lexeme 
“SNOW”  is  in  the  category  called  “NOUN”. 

The  syntactic  objects  represented  in  our  network  knowledge  base  include  mor¬ 
phemes,  surface  strings,  and  nodes  of  annotated  parse  trees.  Individual  mor¬ 
phemes  are  represented  as  nodes  whose  identifiers  or  print  names  are  the  mor¬ 
phemes  themselves.  The  representation  of  a  surface  string  utilized  in  this  study 
consists  of  a  network  version  of  the  list  structure  used  by  Pereira  and  Warren 
(1980).  This  representation  is  also  similar  to  Kay’s  charts  (1973)  in  that  whenever 
alternative  analyses  arc  made  for  a  given  substring  of  a  sentence,  the  sentence 
structure  is  enhanced  by  multiple  structures  representing  these  alternative  analy¬ 
ses.  Retention  of  the  alternatives  avoids  the  reanalyses  of  previously  processed 
substrings  which  occurs  in  a  backtracking  system.  Our  basic  representation  of  a 
surface  string  is  illustrated  in  Figure  4. 

Nodes  identified  by  the  atoms  BO,  SNOW,  IS,  and  WHITE  are  atomic  nodes 
and  represent  objects:  the  empty  string,  and  tokens  “SNOW”,  “IS”,  and 
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Category 

Type  at  concept 

Non-dominated  (asserted) 

Asserted  proposition  which  is 

molecular  node 

"believed"  by  the  System 

Dominated  molecular  node 

Proposition  or  structured  object  which 
is  a  participant  in  a  proposition 

Atomic  node 

Object 

Fig.  2.  Table  of  node  categories 


Hg.3.  Ml  and  M2  are  simple  examples  of  propositional  nodes 


Fig.  4.  Basic  network  representation  of  a  surface  string 


“WHITE”,  respectively.  Node  M4  is  molecular  and  represents  the  initial  string 
“SNOW”.  M5  is  also  molecular  and  represents  the  initial  string  “SNOW  IS”,  and 
similarly  for  node  M6.  A  node  such  as  M6  that  represents  an  object  would  typi¬ 
cally  be  dominated  in  our  system  by  some  node  representing  a  proposition 
about  it. 

As  each  word  of  an  input  string  is  read  by  the  system,  the  network  representa¬ 
tion  of  the  string  is  extended  and  relevant  rules  stored  in  the  SNePS  network  are 
triggered. 

Interpretations  of  surface  strings  are  also  represented  as  nodes  of  our  network 
knowledge  base.  The  kernel  language  of  the  system  enables  the  user  to  define  case 
frame  structures  and  to  define  rules  to  guide  the  system  in  interpreting  input  utter¬ 
ances. 
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1.6  Core  Knowledge  and  the  Kernel  Language 

Our  approach  is  to  provide  the  teacher  (user)  with  a  kernel  language  in  which  she 
can  begin  to  ‘‘explain’*  the  syntax  and  semantics  of  some  natural  or  invented  lan¬ 
guage  to  the  system.  The  present  version  of  our  kernel  language  includes: 

a)  predefined  terms  such  as  L-CAT,  the  set  of  the  names  of  lexical  classes,  and 
S-CAT,  the  set  of  the  names  of  string  classes;  S-CAT  contains  the  important 
category  names  ANT-CLAUSE,  CQ-CLAUSE,  and  RULE-STMT,  which  are 
used  to  bootstrap  into  a  more  sophisticated  rule  input  capability; 

b)  predefined  objects  such  as  (i)  initial  strings  and  (ii)  bounded  strings  with  begin¬ 
ning  and  ending  token; 

c)  predefined  relations  such  as  (i)  lexeme  L  is  a  member  of  category  C;  (iO 
bounded  string  B  is  a  member  of  category  C  and  this  structure  is  represented 
by  S;  (iii)  structure  S  expresses  concept  C;  (iv)  structure  SI  is  a  constituent  of 
structure  S2; 

d)  predefined  functions  such  as  0)  a  test  to  determine  whether  two  network  nodes 
are  identical,  (ii)  a  test  to  determine  whether  two  bounded  surface  strings 
match,  and  (iii)  a  test  to  determine  whether  one  bounded  string  precedes 
another  bounded  string. 

The  KL  provides  the  teacher  with  a  basic  language  of  rewrite  rules  for  the  purpose 
of  defining  synmcr/c  lexical  insertions,  context  free  phrase  structure  rules,  and  con¬ 
text  sensitive  rules  as  well  as  semantic  mappings  from  string  categories  to  case 
frame  structures  and  mappings  from  string  categories  to  case  frame  participant  or 
component  slots. 


1.7  Metalanguage  Conventions  and  Symbols 

In  this  chapter,  we  use  the  notational  convention  that  words  written  in  upper  case 

letters  denote  words  of  KL  and  we  use  the  metasymbols: 

0  denote  a  non-terminal;  if  the  angle  brackets  enclose  the  name  of  a  category  of 
the  core  or  a  user-defined  category,  then  such  usage  denotes  a  variable  whose 
domain  is  the  category  named  within  the  enclosing  brackets. 

0  for  grouping. 

*  Kleene  star;  when  used  as  a  superscript  on  an  item,  denotes  zero  or  more  of 
the  items  in  a  finite  sequence. 

+  when  used  as  a  superscript  on  an  item,  denotes  one  or  more  of  the  items  in  a 
finite  sequence. 

. . .  ellipsis. 
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2  Core  Knowledge  and  Representations 

2.1  Uniform  Representation  and  Intensional  Constructs 

We  use  the  semantic  network  formalism  to  represent  both  syntactic  and  semantic 
knowledge  in  the  form  of  assertions  and  rules  to  be  applied  in  inference  forma¬ 
tion.  We  include  linguistic  knowledge  in  the  network  knowledge  base  and  use  the 
network  formalism  as  a  uniform  "language”  with  which  to  represent  all  types  of 
knowledge.  Thus  we  model  surface  strings  and  syntactic  properties  and  categories 
as  intensions  (Woods,  1975;  Brachman,  1977;  Maida  and  Shapiro,  1982),  con¬ 
cepts,  or  objects  of  thought 


2.2  Predefined  Categories,  Objects,  Relations,  Functions 
2.2.1  Predefined  Categories 

We  are  investigating  the  capability  of  an  NLU  system  becoming  more  adept  in  the 
use  of  some  language  by  being  instructed  in  the  use  of  the  language.  The  system 
must  start  with  some  language  fadlity,  but  we  are  striving  to  make  the  core  knowl¬ 
edge  base  as  small  and  theory-independent  as  possible. 

Included  among  the  core  primitives  are  certain  predefined  categories.  Since  we 
are  designing  a  language  capability  that  is  as  theory-independent  as  possible  and 
not  a  robust  parser  for  a  predetermined  language  such  as  English,  some  of  these 
categories  are  initially  empty,  while  others  have  very  few  members.  All  the  catego¬ 
ries  are  to  be  utilized  by  the  teacher,  cither  directly  or  indirectly,  and  the  member¬ 
ship  of  the  categories  expanded  by  the  teacher  as  the  definition  of  her  target  lan¬ 
guage  takes  shape. 

The  most  basic  of  these  categories  are  L-CAT,  S-CAT,  and  VARIABLE.  L-CAT 
consists  of  the  names  of  lexical  classes  or  classes  of  terms.  L-CAT  initially  con¬ 
tains  the  predefined  terms  L-CAT,  S-CAT,  VARIABLE,  PUNCTUATION,  and 
FUNCTION-NAMES.  Names  that  the  teacher  would  add  to  L-CAT  might 
include,  for  example,  NOUN,  VERB,  and  PREPOSITION. 

The  purpose  of  VARIABLE  is  to  contain  all  the  identifiers  that  the  teacher  will 
use  as  variables  in  her  processing  rules  when  stated  as  input  to  the  system.  The 
VARIABLE  category  is  initially  empty. 

The  category  PUNCTUATION  initially  contains  the  punctuation  marks  period, 
single-quote,  and  double-quote^ 

The  class  FUNCTION-NAMES  contains  the  names  of  the  funaions  that  the 
teacher  has  available  to  be  used  in  a  form  of  procedural  attachment  to  the  declar¬ 
ative  rules  of  the  network  knowledge  base.  FUNCTION-NAMES  initially  con¬ 
tains  the  names  of  the  tests  discussed  in  Section  2.2.4:  IDENTITY-TEST, 
STRING-MATCH-TEST,  and  PRECEDES-TEST. 

S-CAT  is  defined  to  be  the  set  of  all  the  names  of  string  categories.  S-CAT  ini¬ 
tially  contains  the  names  of  the  predefined  string  categories  UTTERANCE,  P- 
RULE,  CASE-FRAME-DEFINITION,  CASE-SLOT-DEFINITION,  LITERAL, 
LITERAL-STRING,  UNIQUE-MEANING-CAT,  VAR-APPOSITION-PHR, 
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MAIN-APPOS-PHR.  VAR-NAME,  ANT-CLAUSE,  CQ-CLAUSE,  and  RULE- 
STMT.  The  string  categories  P-RULE,  CASE-FRAME-DEFINITION,  CASE- 
SLOT-DEHNmON,  LITERAL,  LITERALOTUNG,  and  VAR-NAME  each 
have  predefined  syntax.  For  the  remaining  string  categories  except  UTTER¬ 
ANCE  the  definition  of  the  syntax  is  left  to  the  teadier,  VAR-APPOSITION- 
PHR  and  RULE-STMT  having  restrictions  discussed  later  in  this  article.  The  class 
UTTERANCE  contains  all  input  surface  strings. 

The  predefined  string  category  P-RULE  includes  all  strings  that  qualify  as  pro¬ 
duction  or  syntactic  rewrite  rules  as  discussed  in  Section  2J.2.  These  rewrite  rules 
are  pait  of  the  kernel  language  understood  by  the  system. 

The  kernel  language  includes  semantic  rewrite  rules  to  enable  the  teacher  to 
define  case  frames  and  associations  between  case  frames  and  particular  string 
categories  for  use  in  the  interpretation  of  input  utterances.  CASE-FRAME-DEFI- 
NmON  and  CASE-SLOT-DEFINITION  are  string  categories  that  contain  the 
two  types  of  semantic  rewrite  rules.  The  capability  associated  with  the  CASE- 
FRAME-DERNITION  and  CASE-SLOT-DEFINITION  classes  is  discussed  in 
Section  233. 

LITERAL  is  the  category  of  strings  consisting  of  a  single-quote  mark  followed 
by  a  lexeme.  LITERAL-STRING  is  the  category  of  strings  that  consist  of  a  pair  of 
double-quote  marks  enclosing  a  surface  string. 

VAR-APPOSmON-PHR,  MAIN-APPOS-PHR,  and  VAR-NAME  are  string 
categories  that  enable  a  variable  to  be  used  as  an  appositive  so  as  to  establish  the 
variable  as  the  identifier  for  the  MAIN-APPOS-PHRase  which  it  is  adjacent  to. 
For  example,  after  input  of  appropriate  user-defined  rules  to  the  systenr,  the  string 
“a  noun-phrase  X”  (from  the  sentence  “If  the  head-noun  of  a  noun-phrase  X  has 
number  Y  then  X  has  number  Y")  could  be  parsed  as  a  VAR-APPOSITION-PHR 
with  “a  noun-phrase**  as  the  MAIN-APPOS-PHR  and  “X”  the  VAR-NAME  so 
that  in  parsing  the  stated  rule,  X  is  remembered  by  the  system  as  an  identifier  for 
the  unknown  noun-phrase  refered  to  in  the  phrase  and  is  thus  capable  of  being 
referred  to  again  later  as  in  the  given  rule  example.  Since  no  referendng  mecha¬ 
nisms  are  built  into  our  system  to  enable  the  teacher  to  refer  to  previously  men¬ 
tioned  concepts,  the  above  string  categories  assist  the  teacher  in  establishing  rules 
to  determine  the  referendng  process  according  to  her  own  theory.  The  use  of  this 
capability  is  illustrated  by  example  in  Section  3. 

As  the  teacher  proceeds  to  instruct  the  system  in  her  language  definition,  she 
will  need  to  enter  rules  that  cannot  be  expressed  in  the  language  of  rewrite  rules. 
Such  rules  would  include  rules  concerning  the  semantics  of  utterances.  Therefore, 
the  core  primitives  include  three  initially  empty  string  categories,  RULE-STMT, 
ANT-CLAUSE,  and  CQ-CLAUSE  to  enable  the  teacher  to  define  the  syntax  of 
general  conditional  rules.  These  categories  are  discussed  in  subsequent  sections. 

UNIQUE-MEANING-CAT  is  defined  to  be  the  class  of  all  the  strings  that 
have  a  unique  meaning.  That  is,  if  a  string  is  in  UNIQUE-MEANING-CAT,  it 
must  express  the  same  intension  each  time  it  is  encountered  in  an  input  utterance. 
As  stated  in  Section  1.2,  a  premise  of  our  theory  and  NL  system  is  that  each  time  a 
given  word  or  string  is  “read**  by  the  system,  it  has  a  new  or  different  meaning 
unless  this  meaning  is  determined  by  rules  and/or  assertions  input  by  the 
teacher. 
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U.2  Predefined  Objects 


The  predeftned  objects  essential  to  our  theoiy  and  implementation  are  the  con¬ 
cepts  of  the  Initial  String  and  the  Bounded  String.  These  objects  and  their  network 
representations  are  described  below. 


a)  Initial  string  S  consists  of  the  word  or  symbol  W  con¬ 
catenated  to  the  initial  string  Q.  Q  may  be  the  null  string 
represented  by  node  BO. 


b)  Bounded  string  B  represents  the  surface  string  begin¬ 
ning  with  the  last  word  of  initial  string  SI  and  ending 
with  the  last  word  of  initial  string  S2  where  SI  precedes 
S2. 


1.13  Predefined  Relations 


It  is  necessaiy  for  the  NL  system  to  have  a  set  of  predefined  relations  for  knowl¬ 
edge  representation.  The  current  set  of  these  relations  and  their  corresponding 
semantic  network  structures  are  listed  below. 


a)  Lexeme  L  is  a  member  of  category  C;  e.  g.,  node  M21 
of  RgureS  represents  the  concept  that  ‘STUDENT  is  a 
NOUN. 


b)  The  bounded  string  B  is  in  category  C  and  this  struc¬ 
ture  or  parse  of  the  string  B  is  represented  by  node  S 
(analogous  to  a  node  of  an  annotated  parse  tree);  e.g., 
node  M43  of  Figure  5  represents  the  concept  that  the 
structure  represented  by  B21  represents  a  parsing  of 
the  bounded  string  represented  by  M42  as  an  INDEF- 
NOUN-PHRASE. 


c)  Structure  or  parsed  string  S  expresses  concept  C;  e.g., 
node  M20  of  RgureS  represents  the  concept  that  the 
string  “NOUN”  expresses  the  category  of  nouns  repre¬ 
sented  by  node  BIO. 
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d)  The  structure  SI  is  a  constituent  of  structure  S2;  e.g^ 
node  M44  of  figure  5  represents  the  concept  that  the 
literal  ‘STUDENT  is  a  constituent  of  the  structure  rep¬ 
resented  by  node  B21. 


c)  The  rule  structures  of  SNePS  (Shapiro,  1979  a). 


Figure  5  shows  a  surface  string  enhanced  by  additional  structure  that  would  result 
from  the  system's  reading  and  parsing  the  input  string  “A  STUDENT"  after  some 
syntactic  rules  had  been  input  by  the  teacher  (e.g.,  'A  is  an  INDEF-DET,  'STU¬ 
DENT  is  a  NOUN,  a  string  consisting  of  an  INOEF-DET  followed  by  a  NOUN 
is  an  NOUN-PHRASE). 


2.2.4  Predefined  Functions 

The  following  functions  are  essential  for  the  NL  system  and  could  not  be  effi- 
cently  implemented  in  the  declarative  SNePS  language. 

a)  Identity  test  takes  two  network  nodes  as  arguments,  and  returns  true  if  the  two 
nodes  are  identical  and  returns  false  otherweise. 

b)  String^match  test  takes  two  bounded  strings  in  network  representation  as  argu¬ 
ments,  and  returns  true  if  the  sequence  of  words  or  symbols  in  the  two  strings 
are  identical,  and  returns  false  otherwise. 

c)  Precedes  test  takes  two  bounded  strings  in  network  representation  as  argu¬ 
ments,  and  returns  true  if  the  first  string  precedes  the  second  string  in  the  input 
stream,  and  returns  false  otherweise. 

2.3  The  Reading  Function 

The  system’s  reading  function  “reads”  one  token  (lexeme  or  punctuation  mark)  at 
a  time  from  the  input  stream.  For  each  input  token,  the  structure  of  Figure  6  is 
added  to  the  network,  where  node  S  represents  the  previously  added  initial  string, 
C  represents  the  lexical  category  of  the  token,  I  represents  the  newly  established 
initial  string,  and  B  represents  the  newly  added  bounded  string. 

.If  the  token  belongs  to  any  lexical  categories,  this  membership  would  already  be 
represented  in  the  network  in  the  form  of  relation  (a)  of  Section  2.2.3  (how  such 
relations  are  established  in  explained  in  Section  2.4).  The  lexical  categories  to 
which  the  input  token  belongs  are  found  in  the  network  by  the  reading  function 
and,  for  each  such  category  C,  a  node  such  as  M  of  Figure  6  is  added.  If  no  such 
categories  exist,  then  only  the  initial  string  and  the  bounded  string  are  added. 

Forward  inference  may  be  triggered  by  the  addition  of  the  network  of  Figure  6 
for  each  token,  depending  on  what  rules  are  already  in  the  system.  For  example,  in 
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Figure  5,  nodes  M38  und  M41  are  added  by  the  reading  function  and  nodes  M42, 
M43,  M44,  and  M45  are  built  only  if  there  is  a  rule  in  the  system  that  asserts  that 
an  INDEF-DET  followed  by  a  NOUN  is  an  INDEF-NOUN-PHRASE. 

2.4  The  Representational  Mapping 
2.4.1  Introduction 

Not  all  strings  of  a  language  form  meaningful  '^chunks”.  For  example,  the  sub¬ 
string  “a  large"  from  the  sentence  “A  large  aggressive  dog  frightened  the  girl"  is 
not  a  conceptually  coherent  constituent  of  the  sentence.  Many  researchers,  e.g., 
Fodor  and  Garrett  (1967),  Sever  (1970, 1973),  and  Levelt  (1970,  1974),  have  inves¬ 
tigated  the  relationship  between  surface  constituents  and  the  conceptually  coher¬ 
ent  components  of  an  utterance.  There  seems  to  be  good  evidence  for  surface  con¬ 
stituents  being  the  coherent  units  for  comprehension  of  discourse.  How  sentential 
constituents  or  discourse  constituents  (moving  up  to  a  higher  level  in  the  organiza¬ 
tion  of  text)  are  utilized  in  the  comprehension  process  is  an  active  field  of  research 
(Brown  and  Yule,  1983). 

We  let  R  designate  the  representational  mapping  (Allen,  1978)  from  surface 
strings  to  their  interpretations.  The  domain  of  R  contains  the  categories  of  strings 
that  form  conceptually  coherent  units,  possibly  depending  on  linguistic  or  other 
contexts.  The  domain  of  R  initially  contains  predefined  categories  L-CAT,  S-CAT, 
VARIABLE,  LITERAL,  LITERAL-STRING,  VAR-APPOSITION-PHR,  RULE- 
STMT,  P-RULE,  CASE-FRAME-DEHNITION,  and  CASE-SLOT-DEFINI¬ 
TION.  They  are  discussed  in  the  following  sections. 

We  provide  the  teacher  of  the  system  with  the  facility  for  determining  what  the 
conceptually  coherent  constituents  will  be,  in  addition  to  the  core,  and  for 
instructing  the  system  in  their  use. 


2.4.2  Base  Cases 

The  categories  L-CAT,  S-CAT,  VARIABLE,  LITERAL,  and  LITERAL-STRING 
are  the  base  cases  for  the  representational  mapping.  The  most  basic  subclass  of  the 
domain  of  R  is  L-CAT,  the  class  of  identifiers  for  the  lexical  categories  of  the  sys¬ 
tem,  including  both  system  identifiers  and  user-defined  identifiers.  The  class  L- 
CAT  contains  the  predefined  identifiers  L-CAT,  S-CAT,  and  VARIABLE.  The 
representational  mapping  applied  to  any  identifier  in  L-CAT  maps  to  a  constant 
base  node.  In  Figure  6,  the  interpretations  of  the  identifiers  L-CAT  and  S-CAT  are 
represented  by  nodes  B1  and  B2  respectively.  Similarly,  if  the  system  is  informed 
that  ’NOUN  is  an  L-CAT,  then  its  interpretation  is  represented  by  a  base  node  (B4 
of  Fig. 7).  The  information  that  ’GOOSE  is  a  NOUN  and  that  ’NOUN-PHRASE 
is  in  S-CAT  is  also  represented  in  Figure  7. 

A  member  of  VARIABLE  maps  to  a  corresponding  network  variable  node. 
Since  the  interpretation  of  a  user  variable  must  be  local  to  the  rule  in  which  it  is 
used,  the  representational  mapping  applied  to  the  class  VARIABLE  is  handled  in 
a  special  manner  as  explained  in  Section  3. 
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Fig.  7.  Representation  of  some  basic  lexical  knowledge 


<LZTERAL> 


Fig.  8.  The  representational  mapping  applied 
to  a  LITERAL 


'  <word> 


.<wocd> 


<LITERAL-STRING>  =======«*===»> 


The  representational  mapping  applied  to  a  LITERAL  (defined  in  Section  2.2.1 
as  the  single-quote  mark  followed  by  a  word)  maps  to  the  node  whose  identifier  is 
the  word  itself,  as  illustrated  in  Figure  8. 

The  representational  mapping  applied  to  a  LITERAL-STRING  (a  string 
enclosed  by  double-quote  marks)  maps  to  the  bounded  string  representing  the 
string  enclosed  by  the  quote  marks.  Figure  9  illustrates  the  representational  map¬ 
ping  applied  to  the  literal  string  ‘‘SNOW  IS  WHITE”. 
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<ANT-CLAaSE>  <CQ-CLAOSE>  R(<ANT-CLAOSE>]  R(<CQ-CLA0SE>1 

Fig.  10.  Representational  mapping  applied  to  a  RULE-STMT 


2.43  Propositions  and  Structured  Objects 

Some  string  categories  contained  in  the  domain  of  R  are  mapped  to  non-atomic 
network  (case  frame)  structures  representing  propositions  or  struaured  objects  by 
the  representational  mapping.  The  system  has  just  two  predefined  string  catego¬ 
ries,  namely  P-RULE  and  RULE-STMT,  whose  members’  interpretations  are  rep¬ 
resented  as  non-atomic  structures  by  the  representational  mapping.  P-RULEs 
have  a  predetermined  syntax  and  are  translated  into  SNePS  network  rules  using 
the  predefined  struaures.  RULE-STMT  is  defined  as  the  class  of  strings  that  are 
interpreted  by  the  system  as  general  rules.  This  class  is  initially  empty  and  the  syn¬ 
tax  is  to  be  deterinined  by  the  teacher.  A  RULE-STMT  must  have  an  ANT- 
CLAUSE  and  a  CC^LAUSE  as  constituents.  The  structure  resulting  from  the 
application  of  the  representadonal  mapping  R  to  a  RULE-STMT  is  illustrated  in 
Rgure  10. 

The  ANT-CLAUSE  category  is  defined  as  the  class  of  strings  that  can  be  used 
in  antecedent  position  in  rules  input  by  the  teacher.  Similarly,  the  CQ-CLAUSE 
category  is  defined  as  the  class  of  strings  that  can  be  used  in  consequent  position 
in  rules  input  by  the  teacher.  Both  of  these  classes  are  initially  empty  and  the  syn¬ 
tax  of  RULE-STMTs,  ANT-CLAUSEs,  and  CQ-CLAUSEs  is  to  be  determined  by 
the  teacher.  An  example  is  discussed  in  Section  3. 

The  teacher  can  add  new  string  categories,  whose  interpretations  are  to  be  rep¬ 
resented  by  non-atomic  network  structures,  to  the  domain  of  R  and  specify  the 
semantics  of  these  string  categories  by  using  the  semantic  rewrite  rule  capability 
discussed  in  Sect  2.5.3. 


2.4.4  Partidpants  in  Propositions  or  Relations;  Components  of  Structured  Objects 

In  the  previous  seaion,  the  category  of  natural  language  phrases  that  assert  rela¬ 
tions  between  concepts  or  objects  was  discussed  briefly.  This  type  of  phrase  maps 
to  the  top  node  of  a  molecular  representational  structure. 

Many  phrases  of  natural  language  refer  to  individual  concepts  or  objects  that 
are  participants  in  relations  or  propositions  or  that  are  components  of  structured 
objects.  This  type  of  phrase  would  map  to  a  slot  of  one  or  more  network  case 
frame  structures.  The  system  has  no  predefined  string  categories  which  map  to 
“partidpant”  slots.  The  syntax  for  creating  new  categories  of  this  type  and  their 
associated  semantics  is  discussed  in  2.5.3. 


2.5  Kernel  Language 
2^.1  Predefined  Terms 

As  previously  indicated,  we  are  attempting  to  provide  a  facility  with  which  a  per¬ 
son  can  define  a  target  language  and  yet  keep  the  core  as  small  and  unbiased  as 
possible.  It  is  essential  to  provide  the  person  (teacher)  with  a  kernel  language  with 
which  to  start  building  up  her  language  definition.  The  kernel  language  (KL)  of 
our  system  consists  of  predefined  terms,  syntactic  rewrite  rules,  and  semantic  rew¬ 
rite  rules.  The  predefined  terms  of  the  system  are  the  names  of  the  categores  dis¬ 
cussed  in  Section  2.2.1. 


2.5.2  Syntactic  Rewrite  Rules 

The  kernel  language  includes  linguistic  rewrite  rules  to  enable  the  teacher  to 
I.iStraa  die  system  in  the  basic  syntax  of  her  target  language. 


a)  Lexical  Entry:  The  KL  includes  syntactic  production  rules  of  the  form  (L-CAT) 
—  (LITERAL)  where  (L-CAT)  represents  the  name  of  a  lexical  category  that  has 
already  beeh  defined.  A  LITERAL  was  defined  in  Section  2.2,1  as  consistiiig  of 
the  single-quote  followed  by  a  word  (the  single-quote  is  part  of  the  KL  and  indi¬ 
cates  that  the  following  word  is  mentioned  rather  than  used).  This  form  of  produc¬ 
tion  rule  is  the  means  of  entering  lexical  items  such  as 


L-CAT— ’NOUN 
L-CAT^  ’PROPER-NOUN 
L-CAT-* ’DEF-DET 
L-CAT-*  ’INDEF-DET 
L-CAT  -*  ’VERB 
L-CAT-^  ’BE-VERB 
L-CAT  ^  ’ADVERB 
L-CAT  —  ’ADJECTIVE 
NOUN  ^  ’GOOSE 
NOUN  ’GEESE 
PROPER-NOUN  —  ’GRADY 
PROPER-NOUN  ^  ’GLADYS 
DEF-DET  ^  ’THE 
INDEF-DET  —  ’A 
VERB  —  ’HAS 
BE-VERB  ’IS 

ADVERB  ^  THEN 


L-CAT  ’PREPOSITION 
L-CAT  ^’CONJ 
L-CAT  —  ’PROPERTY 
S-CAT  —  ’HEAD-NOUN 
S-CAT  ^  ’STRING 
VARIABLE  -*  ’X 
VARIABLE  —  ’Y 

ADJECTIVE  —  ’WHITE 
ADJECTIVE  ^  ’SINGULAR 
ADJECTIVE  —  ’PLURAL 
PREPOSITION  —  ’OF 
CONJ  ^  ’IF 
PROPERTY  —  ’COLOR 
PROPERTY  -*  ’NUMBER 
UNIQUE-MEANING-CAT  - 
’ADJECTIVE 

UNIQUE-MEANING-CAT  - 
’PROPERTY 
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b)  Context  Free  Rules:  The  KL  includes  rules  of  the  form 
(S-CAT)  -*•  (s)i. .  .(s}k,  k> 0, 

where  (S-CAT)  represents  the  name  of  a  string  category  and  for  each  i,  (s)i  is  either 
a  LITERAL,  the  name  of  a  lexical  category  previously  entered  as  a  member  of 
L-CAT  as  in  (a)  above,  or  the  name  of  a  string  category. 

Examples:  PROPERTY-CLAUSE  -*  SUBJECT  PREDICATE 
SUBJECT  —  NOUN-PHRASE 
NOUN-PHRASE  —  LITERAL 
NOUN-PHRASE  —  VARIABLE 
NOUN-PHRASE  —  PROPER-NOUN 
PREDICATE  —  RELATION-PREDICATE 
PREDICATE  —  BE-PREDICATE 

RELATION-PREDICATE  —  RELATION  PREDICATE-ADJ 
BE-PREDICATE  —  BE-VERB  PROPERTY-INDICATOR 
RELATION  —  ’HAS  PROPERTY-INDICATOR 
PROPERTY-INDICATOR  —  PROPERTY-CLASS-INDICATOR 
PROPERTY-INDICATOR  —  PROPERTY 
PROPERTY-CLASS-INDICATOR  —  PREDICATE-ADJ 
RULE-STMT  —  TF  ANT-CLAUSE  THEN  CQ-CLAUSE 

c)  Context  Seiisitive  Rules:  The  KL  includes  syntactic  production  rules  of  the  form 

(Is),. .  .0s)„  —  (rs),. .  .(rs)„.  n  >  0, 

where  each  element  (Is),  or  (rs),  is  either  a  LITERAL,  the  name  of  a  lexical  cate¬ 
gory,  or  the  name  of  a  string  category;  both  sides  of  the  rule  must  have  the  same 
number  of  elements  and  for  each  element  (Is),  of  the  left  side, 

1)  if  (ls)i  is  a  LITERAL  or  lexical  category,  then  the  corresponding  element  (rs),  of 
the  right  side  must  be  the  same  as  ^s),; 

2)  if  (ls)i  is  the  name  of  a  string  category,  then  the  corresponding  element  (rs),  of 
the  right  side  can  be  either  a  LITERAL,  lexical  category  name,  or  string  cate¬ 
gory  name. 

This  facility  allows  the  user  to  enter  context  sensitive  rules,  such  as: 

RELATION  PREDICATE-ADJ  —  RELATION  ADJECTIVE 
RELATION  PREDICATE-ADJ  —  RELATION  VARIABLE 
BE-VERB  PREDICATE-ADJ  —  BE-VERB  ADJECTIVE 
TF  ANT-CLAUSE  —  TF  PROPERTY-CLAUSE 
THEN  CQ-CLAUSE  —  THEN  PROPERTY-CLAUSE 

The  first  rule  asserts  that  in  the  context  of  a  RELATION,  an  ADJECTIVE  is  rec¬ 
ognized  as  a  PREDICATE-ADJ,  the  second  asserts  that  in  the  context  of  a 
RELATION,  a  VARIABLE  is  parsed  as  a  PREDICATE-ADJ,  and  the  third 
asserts  that  in  the  context  of  a  BE-VERB,  an  ADJECTIVE  is  parsed  as  a  PREDI¬ 
CATE-ADJ.  Similarly,  the  fourth  and  fifth  rules  state  that  following  the  word  “IF” 
or  “THEN",  a  PROPERTY-CLAUSE  is  parsed  as  an  ANT-CLAUSE  or  CQ- 
CLAUSE,  respectively. 
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SUBJECT  : :  NOUN-PHRASE 

NOUN-PHRASE  ::  LITERAL 

NOUN-PHRASE  ::  VARIABLE 

NOUN-PHRASE  ::  PROPER-NOUN 

PROPERTY-INDICATOR  ::  PROPERTY-CLASS-INDICATOR 

PROPERTY-INDICATOR  ::  PROPERTY 

PREDICATE-ADJ  ::  ADJECTIVE 

PREDICATE-ADJ  ::  VARIABLE 

define  the  semantics  of  a  PROPERTY-CLAUSE  to  be  a  case  frame  with  three 
slots,  such  that  the  PROPERTYOF  slot  is  filled  by  the  interpretation  of  the  SUB¬ 
JECT  of  the  PROPERTY-CLAUSE,  the  PROPERTY  slot  is  filled  by  the  interpre¬ 
tation  of  the  PROPERTY-INDICATOR  consUtuent  of  the  PROPERTY- 
CLAUSE,  and  the  VALUE  slot  is  filled  by  the  interpretation  of  the  PREDICATE- 
ADJ  constituent  The  second  and  third  definitions  above  indicate  that  the 
interpretation  of  an  ANT-CLAUSE  is  the  same  as  the  interpretation  of  its  PROP¬ 
ERTY-CLAUSE  constituent  and  that  the  interpretation  of  a  CQ-CLAUSE  is  the 
same  as  the  interpretation  of  its  PROPERTY-CLAUSE  constituent,  respectively. 
The  next  rule  defines  the  interpretation  of  a  SUBJECT  to  be  the  same  as  the  inter¬ 
pretation  of  its  NOUN-PHRASE  constituent.  The  next  three  rules  define  the 
interpretation  of  a  NOUN-PHRASE  to  be  the  interpretation  of  either  its 
LITERAL  constituent,  its  VARIABLE  constituent,  or  its  PROPER-NOUN  con¬ 
stituent,  whichever  it  has.  The  remaining  rules  are  similarly  understood  by  the  sys¬ 
tem. 

The  representational  mapping  R  builds  a  network  structure  such  as  that  domi¬ 
nated  by  node  M  of  Rgure  11  as  the  interpretation  of  a  PROPERTY-CLAUSE. 


Fig. It.  Representational  mapping  applied  to  a  PROPERTY-CLAUSE 


b)  Case  Frame  Slot-Filler  Definitions;  In  order  to  provide  a  capability  for  defining 
the  semantics  of  a  phrase  whose  interpretation  is  a  slot-filler  in  a  case  frame,  the 
follo^t'ing  type  of  semantic  rewrite  rule  is  included.  The  syntax  of  the  CASE- 
SLOT-DEFINITION  is 

(phr-name)  >  >  ([(slot-name)i  (string-name)i .  .  . 

(slot-name)„  (string-name)n 

where  the  square  brackets  are  part  of  the  object  language  and  the  4-  and  the  pa¬ 
rentheses  are  metasymbols.  The  (phr-name)  is  the  name  selected  by  the  teacher  for 
the  category  of  strings  whose  semant^  are  defined  by  the  expression  to  the  right 
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a)  Case  Frame  Definitions:  The  KL  includes  language  to  enable  the  teacher  to 
define  case  frames  and  instruct  the  system  in  their  use  by  using  the  syntax  of  a 
CASE-FRAME-DERNITION: 

(string-cat)  : :  (sIot-name)i  (constit-name)i .  .  . 

(slot-name)n  (constit-name)„ 

where  n>0.  Such  a  CASE-FRAME-DEFINITION  is  used  by  the  system  as  fol¬ 
lows;  A  string  that  is  identified  as  being  in  category  (string-cat)  is  mapped  into  a 
case  frame  such  that  for  each  slot  identified  by  (slot-name);,  the  slot-filler  is  the 
interpretation  of  the  constituent  string  identified  by  (constit-name);.  The  constitu¬ 
ent  strings  need  not  be  immediate  constituents  of  the  string  in  category  (string-cat). 
The  same  (constit-name)  can  be  used  to  specify  the  ftller  for  more  than  one  slot. 
For  example,  suppose  the  teacher  wants  to  define  a  language  in  which  an  utter¬ 
ance  such  as  “JOHN  BOUGHT  A  HOUSE”,  involving  the  act  of  purchase,  L 
interpreted  to  mean  that  the  person  bought  the  object  for  himself  unless  otherwise 
stated.  To  handle  the  semantics  of  such  a  clause,  the  teacher  might  want  to  define 
a  case  frame  with  AGENT  and  BENEFICIARY  slots  which  are  both  filled  by  the 
interpretation  of  the  same  constituent  of  the  clause.  Our  CASE-FRAME-DEFI¬ 
NITION  facility  provides  for  this  eventuality. 

If  a  string  is  parsed  as  a  (string-cat)  but  is  missing  a  constituent  specified  by  the 
semantic  rewrite  rule,  then  a  default  representation  of  the  slot-filler  corresponding 
to  the  missing  constituent  is  established  in  the  form  of  an  atomic  node  about 
which  the  system  knows  nothing,  other  than  its  being  a  participant  in  the  case 
frame.  In  the  context  of  a  RULE-STMT  the  atomic  node  will  be  a  variable  node 
(see  Seaion  1 J),  otherwise  a  constant  node.  For  example,  to  represent  the  inter¬ 
pretation  of  a  sentence  such  as  “THE  HOUSE  WAS  PURCHASED  YESTER¬ 
DAY”  the  teacher  might  want  to  use  the  same  case  frame  mentioned  in  the  pre¬ 
ceding  paragraph.  Since  an  AGENT  and  BENEFICIARY  are  implicitly  part  of 
the  act  of  purchase,  but  not  explicitly  mentioned  in  the  sentence,  it  is  reasonable 
for  the  unmentioned  participants  to  be  represented  in  the  interpretation  of  the  sen¬ 
tence.  The  above  default  representation  for  the  interpretation  of  a  missing  constit¬ 
uent  provides  the  teacher  with  a  facility  for  instructing  the  system  how  to  interpret 
such  a  sentence. 

An  alternative  syntax  for  the  CASE-FRAME-DEFINITION  is 
(string-cat): : (constit-name). 

The  right  side  of  the  ; ;  symbol  is  a  degenerate  case  frame  and  the  definition  is 
interpreted  as  meaning  that  the  semantics  of  the  string  of  category  (string-cat)  is 
the  same  as  that  of  the  constituent  string  of  category  (constit-name).  For  example, 

PROPERTY-CLAUSE::  PROPERTY  OF  SUBJECT 

PROPERTY  PROPERTY-INDICATOR 

VALUE  PREDICATE-ADJ 

ANT-CLAUSE  : :  PROPERTY-CLAUSE 

CQ-CLAUSE  : :  PROPERTY-CLAUSE 
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of  the  symbol  >  > .  The  object  language  symbol  +  must  be  used  in  place  of  at 
least  one  (string-name),  designating  the  position  of  the  interpretation  of  the  string 
(phr-name)  in  the  case  frame. 

Each  set  of  brackets  encloses  a  case  frame  definition  as  described  in  the  previ¬ 
ous  section.  That  is,  each  slot  named  (sIot-name)i  is  filled  by  the  interpretation  of  a 
string  in  category  (string-name)i,  if  a  string  of  category  (string-name)i  is  present  as  a 
(not  necessarily  immediate)  constituent  of  the  string  of  category  (phr-name).  The 
-I-  symbol  marks  the  slot  whose  filler  is  the  interpretation  of  the  (phr-name)  string. 
The  system  represents  the  interpretation  of  the  ^hr-name)  string  (1)  as  a  variable 
atomic  node  if  the  semantic  rule  is  used  in  the  context  of  a  RULE-STMT  and  (2) 
as  a  constant  atomic  node,  otherwise.  If  a  slot-filler  constituent  is  specified  in  a 
semantic  rewrite  rule,  but  is  missing  from  the  surface  string  to  which  the  rule  is 
being  applied,  a  default  representation  of  the  slot-filler  corresponding  to  the  miss¬ 
ing  constituent  is  established  in  the  form  of  an  atomic  node  about  which  the  sys¬ 
tem  knows  nothing,  other  than  its  being  a  participant  in  the  case  frame  (as  in  the 
previous  section  for  a  CASE-FRAME-DEFINITION). 

Consider  the  following  example  CASE-SLOT-DEFINITION: 

PROPERTY-CLASS-INDICATOR  >  >  (MEMBER  ADJECTIVE  PROP¬ 
ERTY-CLASS  -I-] 

According  to  this  rule,  a  PROPERTY-CLASS-INDICATOR  should  have  an 
ADJECTIVE  constituent  and  the  interpretation  of  a  string  of  the  PROPERTY- 
CLASS-INDICATOR  category  would  be  represented  by  an  atomic  node  which 
fills  the  PROPERTY-CLASS  slot  of  a  case  frame  whoso  MEMBER  slot  is  filled 
by  the  interpretation  of  the  ADJECTIVE  constituent  The  mapping  from  a  surface 
string  to  the  network  representation  of  its  interpretation  is  illustrated  in  Figure  12. 

In  order  to  prepare  for  the  example  discussed  in  Sect  3,  the  following  rule  is 
input: 

INDEF-S-PHRASE  >  >  [BSTR  STRING  CAT  S-CAT  STRC-I-] 

According  to  this  rule,  the  interpretation  of  a  string  parsed  by  the  System  as  an 
INDEF-S-PHRASE  would  be  represented  by  an  atomic  node  filling  the  STRC 
slot  of  a  case  frame  whose  CAT  slot  is  filled  by  the  interpretation  of  the  S-CAT 
constituent  of  the  INDEF-S-PHRASE  and  whose  BSTR  slot  is  filled  by  the 
STRING  constituent.  This  is  illustrated  in  Figure  13. 


R 

<PROPERTY-CLASS-INDICATOR>  — 


<ADJECTIVE> 

Fig.  12.  Representational  mapping  applied  to  a  PROPERTY-CLASS-INDICATOR 
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<STRING>  <S-CAT> 

Fig.  13.  Representational  mapping  applied  to  an  INOEF>S-PHRASE 


MOUN> PHRASE 


PROPER-NOUN 

GRAOy 


RELATION 


PREDICATE-AM 


Adjective 

I 

WHITE 


Fig.  14.  Parse  tree  for  sample  input  utterance 


2.6  Use  in  Language  Processing 

To  illustrate  the  system’s  use  of  the  language  definition  developed  via  the  rewrite 
rules  of  the  preceding  sections,  we  show  some  sentences  of  this  language  which 
refer  to  the  language  itself  compared  with  some  that  refer  to  a  non-linguistic 
domain.  Thinking  affectionately  of  her  pet  geese,  the  teacher  informs  the  system 
that  “GRADY  HAS  COLOR  WHITE".  The  system  recognizes  and  builds  the 
parse  tree  of  Figure  14  for  the  utterance.  We  show  the  more  conventional  form  of 
the  parse  tree,  rather  than  the  equivalent  network  parse  tree  that  the  system  actu¬ 
ally  builds  in  order  to  simplify  the  figure. 

In  the  preceding  sections,  the  teacher  has  entered  rewrite  rules  into  the  system 
to  define  the  semantics  for  certain  string  classes  (e.g.,  PROPERTY-CLAUSE, 
PROPERTY-INDICATOR),  thereby  identifying  the  conceptually  coherent  constit¬ 
uents  for,the  language  definition.  The  system  applies  these  semantic  rewrite  rules 
and  builds  the  structure  of  Figure  15  as  the  interpretation  of  the  utterance.  The 
assertion  that  the  parsed  input  utterance  expresses  the  concept  represented  by 
node  M75  is  also  established  in  the  network. 

Similarly,  the  system  can  process  the  input  utterance  “‘GOOSE  HAS  NUM¬ 
BER  SINGULAR”.  The  resulting  parse  tree  is  shown  in  Figure  16. 

The  representation  of  the  interpretation  of  the  utterance  is  shown  in  Figure  17. 
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PROPERTY-CLAUSE 


SUBJECT 


NOUN-PHRASE 


LITERAL 
•  GOOSE 

Fig.  16.  Parse  tree  for  utterance  concerning  language 


PREDICATE 

RELATION-PREDICATE 


RELATION 


PREDICATE-ADJ 

ADJECTIVE 

I 

SINGULAR 


Fig.  17.  Representation  of  interpretation  of  utterance 


As  Stated  in  Section  1.5,  the  NL  system  distinguishes  between  a  word  or  phrase 
and  its  interpretation.  The  interpretation  of  a  LITERAL  is  the  word  following  the 
quote  mark  (the  word  ’GOOSE,  in  this  case).  Thus  node  M80  represents  the  prop¬ 
osition  that  singular  number  is  a  property  of  the  word  ’GOOSE  and  not  of  the 
concept  expressed  by  the  word  ’GOOSE.  On  the  other  hand,  the  interpretation  of 
the  word  ’GRADY  is  represented  by  node  B20  of  Figure  15,  and  it  is  this  entity 
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that  has  color  white.  Comparing  these  two  examples  illustrates  the  knowledge  rep¬ 
resentations  we  have  established  as  well  as  the  capability  for  handling  strings  and 
their  interpretations  as  domain  knowledge,  which  is  fundamental  to  our  theory 
and  system. 

At  this  stage,  the  teacher  can  simplify  the  language  to  use  with  the  system  for 
expressing  properties.  She  does  this  by  inputting  the  following  rewrite  rules  so  that 
property  class  entries  can  be  made. 

PROPERTY-CLASS-ENTRY  —  ADJECTIVE  ’IS  ’A  PROPERTY 

PROPERTY-CLASS-ENTRY  ::  MEMBER  ADJECTIVE 

PROPERTY-CLASS  PROPERTY 

Since  the  system  has  previously  been  informed  that  ’WHITE  is  an  ADJECTIVE 
and  ’COLOR  is  a  PROPERTY,  the  utterance  “WHITE  IS  A  COLOR’’  would  be 
recognized  by  the  system  as  a  PROPERTY-CLASS-ENTRY.  Also  since,  in  Sec¬ 
tion  2.5  J,  for  the  purposes  of  this  example,  the  teacher  entered  ’ADJECTIVE  and 
’PROPERTY  into  UNIQUE-MEANING-CAT,  the  surface  strings  that  are  in  the 
categories  ADJECTIVE  and  PROPERTY  are  each  treated  as  having  a  unique 
interpretation.  Thus  different  instances  of  the  same  string  such  as  ’WHITE  are 
treated  by  the  system  as  having  the  same  interpretation  and  it  uses  just  one  net¬ 
work  structure  to  represent  this  interpretation.  Therefore,  using  the  above  seman¬ 
tic  rewrite  rule,  the  system  builds  the  structure  of  node  MSS  of  Figure  18  to  repre¬ 
sent  the  interpretation  of  the  utterance,  finding  the  nodes  B21  and  B22  of 
Figure  15  to  represent  the  interpretation  of  ’COLOR  and  ’WHITE,  respectively. 

Similarly,  the  system  can  be  informed  that  “PLURAL  IS  A  NUMBER”  and  it 
builds  a  structure  similar  to  that  of  Hgure  18  to  represent  the  assertion  that  the 
concept  expressed  by  ’PLURAL  is  a  member  of  the  property-class  NUMBER. 

If  the  utterance  “GLADYS  IS  WHITE”  is  now  input  to  the  system,  the  utter¬ 
ance  is  also  recognized  as  a  PROPERTY-CLAUSE  as  shown  in  Rgure  19. 

The  semantic  rewrite  rules  of  the  previous  section  are  used  by  the  system  to 
build  the  structure  dominated  by  node  M90  as  the  representation  of  the  interpreta¬ 
tion  of  the  utterance. 

According  to  the  semantic  rule  for  a  PROPERTY-CLAUSE,  the  PROPERTY 
slot  in  the  case  frame  is  filled  by  the  interpretation  of  the  PROPERTY-INDICA¬ 
TOR  constituent  of  the  utterance.  Referring  to  the  parse  tree  of  Figure  19,  the 
PROPERTY-INDICATOR  consists  of  the  PROPERTY-CLASS-INDICATOR. 
The  semantic  rule 

PROPERTY-INDICATOR: :  PROPERTY-CLASS-INDICATOR 

of  the  previous  section  instructs  the  system  to  use  the  interpretation  of  the  PROP¬ 
ERTY-CLASS-INDICATOR  as  the  interpretation  of  the  PROPERTY-INDICA¬ 
TOR.  Vhe  rule  for  interpreting  a  PROPERTY-CLASS-INDICATOR  is  the 
CASE-SLOT-DEFINITION  presented  in  the  previous  section; 

PROPERTY-CLASS-INDICATOR  >  >  [MEMBER  ADJECTIVE  PROP¬ 
ERTY-CLASS  -t-  ] 

This  rule  instructs  the  system  to  interpret  the  PROPERTY-CLASS-INDICATOR 
as  the  PROPERTY-CLASS  slot-filler  of  the  frame  whose  MEMBER  SLOT  is 
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Fig.  18.  Representation  of  interpretation  of  utterance 


.  PROPERTY-CLAOSE 


Fig.  19.  Parse  tree  for  input  uttetanoe 


filled  by  WHITE.  Node  B22  of  Figures  15  and  18  is  found  as  the  representation  of 
the  interpretation  of  ’WHITE,  since  the  members  of  the  class  ADJECTIVE  have 
been  defined  by  the  teacher  as  having  '^unique  semantics”.  Thus,  the  system  uses 
node  B22  to  use  as  the  MEMBER  slot-filler  for  the  case  frame  associated  with  a 
PROPERTY-CLASS-INDICATOR.  Then  B21  of  Figure  15  is  found  and  used  as 
the  PROPERTY-CLASS  slot-filler  as  shown  in  Figure  20,  since  it  represents  the 
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PROPERTY-CLASS  that  has  WHITE  as  a  MEMBER.  Node  B21  is  also  the  rep¬ 
resentation  of  the  interpretation  of  the  PROPERTY-CLASS-INDICATOR  string. 
In  general,  a  CASE-SLOT-DEFINITION  maps  a  surface  string  to  a  participant  in 
a  relation  or  proposition. 

In  a  manner  similar  to  the  parsing  and  interpretation  of  the  utterance 
“GLADYS  IS  WHITE”,  the  system  also  understands  the  utterance  “‘GEESE  IS 
PLURAL”.  The  system's  language  definition  is  again  used  as  a  metalanguage  to 
expand  upon  the  same  language  itself. 


3  Increasing  the  System^s  Language  Capability  Through 
Its  Language  Capability 

3.1  Motivation 

Since  we  treat  linguistic  knowledge  as  domain  knowledge,  the  system  teacher 
(user)  can  add  to  the  knowledge  base  and  instruct  the  system  as  to  how  to  process 
or  understand  ever  more  sophisticated  language. 

Just  as  a  person  is  continually  influenced  by  interaction  with  his  environment, 
the  data  base  of  our  system  is  modified  by  each  input  The  knowledge  base  is 
incrementally  enhanced  to  form  a  more  sophisticated  system. 

Since  we  represent  language  processing  knowledge  in  the  same  knowledge  base 
and  in  the  same  formalism  as  other  domain  knowledge,  it  is  possible  to  make  the 
system’s  language  processing  knowledge  the  subject  of  its  language  processing 
and  this  is  a  fundamental  aspea  of  our  approach.  Thus  by  instructing  the  system 
in  the  domain  of  linguistics  as  we  would  expect  to  be  able  to  do  with  another 
domain  in  an  interactive  NLU  system,  we  can  increase  the  system’s  language 
capability  through  its  language  capability.  A  user  can  communicate  with  our  sys¬ 
tem  in  Just  one  language  via  one  processor  without  switching  “modes”  or  interact¬ 
ing  with  supportive  processors  in  special  purpose  languages. 

The  rewrite  rules  of  the  KL  are  certainly  not  sufficient  for  expressing  all  the 
rules  a  teacher  would  need  to  define  a  language  of  her  choice.  Therefore,  one  of 
the  most  important  capabilities  that  the  system  needs  is  to  understand  a  more  gen¬ 
eral  form  of  rule  statement.  A  teacher  should  be  able  to  bootstrap  into  a  more 
powerful  rule  statement  language  from  the  KL.  In  the  next  sections,  we  present  an 
example  from  such  a  bootstrap  process. 


2.2  Defining  More-General  Rule  Forms 

The  teacher  first  extends  the  system’s  language  definition  so  that  it  can  begin  to 
understand  general  “IF-THEN”  rules.  As  stated  in  Section  2.2.1,  RULE-STMT  is 
a  predefined  category.  The  syntax  of  a  RULE-STMT  is  not  predefined,  but  for  the 
interpretation  process,  each  RULE-STMT  must  have  an  ANT-CLAUSE  and  a 
CQ-CLAUSE  constituent.  The  ANT-CLAUSE  constituent  is  interpreted  as  the 
antecedent  of  the  rule  and  the  CQ-CLAUSE  constituent  as  the  consequent  of  the 
rule.  Thus  the  rewrite  rule  on  to 


RULE-STMT  —  TF  ANT-CLAUSE  THEN  CQ-CLAUSE 

defines  a  syntax  for  the  RULE-STMT.  The  syntax  and  semantics  of  ANT- 
CLAUSE  and  CQ-CLAUSE  must  also  be  defined.  This  was  done  is  Section  2.5 
(see  Appendix). 

The  additional  rules  that  the  teacher  chooses  to  input  to  the  system  to  increase 
its  capability  of  understanding  linguistic-domain  language  for  this  example  are 
listed  below.  In  order  to  make  use  of  the  system's  ability  to  use  a  VARIABLE  as 
an  appositive  to  another  phrase  and  remember  the  association  of  the  VARIABLE 
to  the  prase,  the  teacher  inputs: 

DEF-S-PHRASE  —  DEF-DET  S-CAT 
INDEF-S-PHRASE  —  INDEF-DET  S-CAT 
MAIN-APPOS-PHR  VAR-NAME  —  INDEF-S-PHRASE  VARIABLE 
VAR-APPOSmON-PHR—  MAIN-APPOS-PHR  VAR-NAME 

To  explain  to  the  system  how  to  parse  and  interpret  language  which  describes  one 
phrase  being  a  constituent  of  another,  the  teacher  inputs: 

SUP-STRING-REF  —  VAR-APPOSITION-PHR 
CONSTIT-REF  —  DEF-S-PHRASE 
CONSnr-PHRASE—  CONSTIT-REF  'OF  SUP-STRING-REF 
NOUN-PHRASE  —  CONSTIT-PHRASE 

CONS'nT-PHRASE>>[CONSTrr  +  CONSTITOF  SUP-STRING- 
REF] 

[BSTR  STRING  CAT  DEF-S-PHRASE  STRC  +] 
SUP-STRING-REF  ::  VAR-APPOSITION-PHR 
MAIN-APPOS-PHR : :  INDEF-S-PHRASE 
NOUN-PHRASE  ::  CONSTIT-PHRASE 
DEF-S-PHRASE  : :  S-CAT 

These  rules  will  be  used  in  the  next  sections. 


3.3  Parsing  Strategy 

The  parsing  strategy  applied  by  our  NL  system  is  a  combined  bottom-up,  top- 
down  strategy.  As  each  word  of  an  input  string  is  read  by  the  system,  the  network 
representation  of  the  string  is  extended  as  discussed  in  Section  2.3  and  relevant 
rules  stored  in  the  SNePS  network  are  triggered.  All  applicable  rules  are  started  in 
parallel  in  the  form  of  processes  created  by  our  MULTI-processing  package 
(McKay  and  Shapiro,  1980).  These  processes  are  suspended  if  not  all  their  anteced¬ 
ents  are  satisfied  and  are  resumed  if  more  antecedents  are  satisfied  as  the  reading 
of  the  string  proceeds.  As  parsing  proceeds,  the  annotated  parse  (tree(s)  for  an 
input  utterance  is  (are)  represented  in  the  system's  network  knowledge  base.  Our 
system  builds  and  retains  network  structures  corresponding  to  alternative  analyses 
of  a  given  input  string.  Retention  of  the  alternatives  avoids  the  reanalysis  of  previ¬ 
ously  processed  surface  strings  that  occurs  in  a  backtracking  system. 

Processing  is  controlled  by  the  SNePS  Inference  Package  (Shapiro  et  al.,  1982), 
which  employs  bi-directional  inference.  This  is  a  form  of  inference  resulting  from 
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interaction  between  forward  and  backward  inference  and  loosely  corresponds  to 
bi-directional  search  through  a  space  of  inference  rules.  This  technique  focuses 
attention  towards  the  active  parsing  processes  and  prunes  the  search  through  the 
space  of  inference  rules  by  ignoring  rules  which  have  not  been  activated.  This  cuts 
down  the  fan  out  of  pure  forward  or  backward  chaining.  New  rules  are  activated 
only  if  no  active  rules  are  applicable. 

Consider  the  sample  input  utterance  “IF  THE  HEAD-NOUN  OF  A  NOUN¬ 
PHRASE  X  HAS  NUMBER  Y  THEN  X  HAS  NUMBER  Y“.  When  the  first 
word  is  read  by  the  system,  it  is  recognized  as  matching  the  word  TF  in  the  rules 

RULE-STMT  —  TF  ANT-CLAUSE  THEN  CQ-CLAUSE 
TF  ANT-CLAUSE  —  TF  PROPERTY-CLAUSE 

and  parsing  begins  in  a  bottom-up  manner.  Both  rules  are  triggered  in  parallel  by 
the  SNePS  MULTI  package.  When  originally  input,  each  of  the  above  rules  was 
interpreted  by  the  system  and  stored  in  the  form  of  a  netwoiic  rule  which  we  para¬ 
phrase  as  follows  (NOTE:  In  all  the  paraphrased  rules  of  this  section,  Vt  and  Vj 
are  universally  quantified  variables): 

(1)  If  a  word  of  an  input  string  is  the  word  TF,  then 

(2)  if  V,  f-llows  the  word  TF  and  Vi  is  an  ANT-CLAUSE,  then 

(3)  if  the  word  THEN  follows  Vj,  then 

(4)  if  V2  follows  the  word  THEN  and  V2  is  a  CQ-CLAUSE, 

then  the  string  consisting  of  TF  followed  by  V, 
foUowed  by  THEN  followed  by  V2  is  a  RULE-STMT. 

(5)  If  a  word  of  an  input  string  is  the  word  TF,  then 

(6)  if  V,  follows  the  word  TF  and  V,  is  a  PROPERTY-CLAUSE, 

then  V,  is  an  ANT-CLAUSE. 

(The  numbers  in  parentheses  are  rule  numbers,  not  line  numbers.  Thus,  for  exam¬ 
ple,  nested  rule  (3)  begins  with  “if  the  word  THEN”  and  continues  to  the  period 
a^the  end  of  the  sentence.) 

Since  the  antecedent  of  rule  (1)  above  is  satisfied,  the  system  questions  whether 
a  string  immediately  following  the  word  TF  is  an  ANT-CLAUSE.  When  a  SNePS 
rule  is  triggered,  a  process  is  created  forming  the  active  version  of  the  rule  for  the 
purpose  of  such  activities  as  data  collection  and  variable  binding.  Some  of  these 
processes  act  as  demons,  waiting  for  instances  of  their  antecedents  so  that 
instances  of  their  consequents  can  be  deduced.  This  is  the  case  for  the  nested  rule 
(2).  Since  no  string  follows  the  word  TF  yet,  the  process  for  rule  (2)  is  suspended. 

These  active  processes,  with  their  communication  links,  form  the  equivalent  of  a 
hypothesized  parse  tree  with  associated  expectations.  The  inference  system 
ignores  unactivated  rules  as  long  as  there  are  applicable  active  rule  processes 
avv<uung  uattt,  «»scuu«Jly  parsing  in  a  top-down  manner  in  this  situation.  The 
hypothesized  parse  tree  corresponding  to  the  process  of  rule  (2)  is  illustrated  in 
Figure  21.  The  parsing  strategy  of  our  system  is  similar  to  “left-comer  bottom-up 
parsing”  (Burge,  1975)  in  that  construction  of  a  parse  tree  begins  at  the  bottom  left 
comer,  processing  of  a  surface  string  proceeds  in  a  left-to-right  manner,  and  when¬ 
ever  an  initial  segment  of  a  string  has  been  parsed,  the  system  attempts  to  establish 
a  goal  analysis  of  the  string  or  substring  thereof.  In  the  following  figures,  the  bro- 
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IF 

Fig.  21.  Hypothesized  parse  tree 


Fig.  22. 
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•IF  ANT-CLAUSE  7 
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Hypothesized  parse  tree 


CQ-CLAOSE  7 


ken  lines  indicate  goals  or  expectations  represented  by  antecedents  of  nested  rules 
for  which  active  demons  have  not  yet  been  created.  The  question-marks  indicate 
expectations  which  have  not  yet  been  satisfied. 

The  antecedent  of  rule  (5)  is  also  satisfied.  This  is  a  context  sensitive  rule  which 
constrains  the  parsing  process.  According  to  this  rule,  a  PROPERTY-CLAUSE  is 
parsed  as  an  ANT-CLAUSE  in  the  context  of  the  word  TF.  A  process  is  created 
forming  the  active  version  of  rule  (6)  and  this  process  awaits  a  PROPERTY- 
CLAUSE  following  the  word  TF.  Figure  22  reflects  the  current  state  of  the  system 
in  terms  of  its  active  processes,  implicit  expectations,  and  the  tokens  that  it  has 
consumed. 

When  the  word  THE  is  read  by  the  system,  the  rule 
DEF-S-PHRASE  -*  DEF-DET  S-CAT 

is  triggered  as  parsing  continues  in  a  bottom-up  manner.  This  rule  is  paraphrased 
as: 

(7)  If  V,  is  a  DEF-DET,  then 

(8)  if  V2  follows  Vi  and  V2  is  an  S-CAT, 

then  the  string  consisting  of  V,  followed  by  V2  is  a 
DEF-S-PHRASE. 

The  antecedent  of  rule  (7)  is  satisfied  and  a  process  is  created  for  nested  rule  (8)  to 
await  an  S-CAT  following  the  DEF-DET.  The  active  processes  form  another 
hypothesized  parse  tree  shown  in  Rgure  23. 
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Fte.23.  Hypothesized  parse  tree 
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Fi{.24.  Hypothesized  parse  trees 


Suppose  another  rule  such  as  DEF-NOUN-PHRASE  — ►  DEF-DET  NOUN 
had  been  entered  by  the  teacher  and  is  present  in  the  network  knowledge  base. 
This  rule  is  paraphrased  as: 

(9)  IfV,  is  a  DEF-DET,  then 

(10)  if  V2  follows  Vi  and  Vj  is  a  NOUN,  then  the  string 

consisting  of  V,  followed  by  Vj  is  a  DEF-NOUN-PHRASE. 

This  latter  rule  is  also  triggered  by  the  system’s  reading  of  the  word  THE  and  the 
processes  created  for  rules  (9)  and  (10)  form  another  set  of  hypothesized  parse 
trees  as  illustrated  in  Rgure  24. 
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The  parse  trees  of  Figure  24  dominated  by  DEF-S-PHRASE?  and  DEF- 
NOUN-PHRASE?  represent  alternative  possibilities  for  the  parse  of  the.  string 
beginning  with  the  word  THE  A  process  such  as  the  process  for  rule  (10)  waiting 
for  a  NOUN  may  remain  suspended  indefinitely  if  the  expected  data  is  not  forth¬ 
coming. 

When  the  next  word  ’HEAD-NOUN  is  read,  the  system  recognizes  it  as  an  S- 
CAT  and  the  process  corresponding  to  rule  (8)  is  resumed  since  it  is  waiting  for  an 
S-CAT  following  the  word  ’THE.  Thus  the  string  “THE  HEAD-NOUN’’  is  recog¬ 
nized  as  a  DEF-S-PHRASE  by  application  of  the  teacher’s  rules.  This  DEF-S- 
PHRASE  then  triggers  the  network  version  of  the  following  rule  and  the  DEF-S- 
PHRASE  is  then  recognized  as  a  CONSTIT-REF: 

CONSTIT-REF  ^  DEF-S-PHRASE 

Recognition  of  a  CONSTIT-REF  triggers  the  rule 

CONSTIT-PHRASE  —  CONSTIT-REF  ’OF  SUP-STRING- REF 

whose  network  representation  can  be  paraphrased  as 

(11)  If  V,  is  a  CONSTIT-REF.  then 

(12)  if  the  word ’OF  follows  Vj,  then 

(13)  if  V2  follows  the  word  ’OF  and  Vj  is  a  SUP-STRING-REF, 

then  the  string  consisting  of  Vj  followed  by  the  word 
’OF  followed  by  Vj  is  a  CONSTIT-PHRASE 

Activation  of  rule  (11)  is  analogous  to  bottom-up  processing  again.  A  process  is 
established  for  rule  (12)  to  await  the  word  ’OF  in  the  input  stream. 

When  the  next  word  ’OF  is  read  by  the  system,  the  demon  corresponding  to  rule 
(12)  is  activated  and  since  the  antecedent  of  rule  (12)  is  satisfied,  a  process  is  estab¬ 
lished  for  rule  (13)  to  expect  a  SUP-STRING-REF  following  the  word  ’OF.  No 
other  rules  are  activated  by  the  reading  of  the  word  ’OF  since  an  active  process 
was  waiting  for  this  word  in  the  input  stream. 

The  system  parses  the  next  string  “A  NOUN-PHRASE”  as  an  INDEF-S- 
PHRASE  by  application  of  the  rule 

INDEF-S-PHRASE  INDEF-DET  S-CAT 

This  triggers  the  rule 

MAIN-APPOS-PHR  VAR-NAME  —  INDEF-S-PHRASE  VARIABLE 
which  is  paraphrased  as 

(14)  IfV,  isan  INDEF-S-PHRASE, then 

(15)  if  V2  follows  V,  and  V2  is  a  VARIABLE, 

(16)  then  V,  is  a  MAIN-APPOS-PHR  and  V2  is  a  VAR-NAME. 

Since  the  antecedent  of  rule  (14)  is  satisfied,  a  process  is  set  up  for  rule  (15).  When 
the  next  word  ’X  is  read,  it  is  recognized  as  a  VARIABLE  and  since  the  active 
process  for  rule  (15)  is  waiting  for  a  VARIABLE,  no  unactivated  rules  are  applied. 
An  example  of  such  an  unactivated  rule  is 
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NOUN-PHRASE  —  VARIABLE 

which  we  previously  input  to  the  system.  Thus  an  alternative  parse  is  blocked  by 
the  expectation  of  a  VARIABLE  by  the  process  for  rule  (15).  By  application  of  the 
rules 

VAR-APPOSmON-PHR  ^  MAIN-APPOS-PHR  VAR-NAME 
SUP-STRING-REF  —  VAR-APPOSITION-PHR 

the  expected  SUP-STRING-REF  of  rule  (13)  is  satisfied  aind  the  string  “THE 
HEAD-NOUN  OF  A  NOUN-PHRASE  X”  is  parsed  as  a  CONSTIT-PHRASE. 
By  application  of  the  rule 

NOUN-PHRASE  —  CONSTIT-PHRASE 

the  string  is  also  recognized  as  a  NOUN-PHRASE.  Notice  that  the  term  NOUN¬ 
PHRASE  is  mentioned  m  the  input  string  and  used  in  the  application  of  the  above 
rule. 

At  this  point  in  the  parsing  process  the  hypothesized  parse  trees  are  illustrated 
in  Figure  25. 

As  parsing  proceeds  using  the  rules  introduced  in  this  and  preceding  sections  of 
this  chapter,  the  resulting  parse  of  the  entire  input  statement  is  shown  in  Figure  26. 
The  string  category  identifiers  in  the  tree  that  are  underlined  are  the  categories 
that  are  included  in  the  domain  of  the  representational  mapping.  These  are  the 
categories  for  which  the  teacher  has  defined  a  rule  to  determine  the  interpretation 
of  any  member  of  the  category  (i.e.,  the  underlining  identifies  the  string  categories 
defined  by  the  teacher  as  the  conceptually  coherent  constituents  of  the  utter¬ 
ance). 


ROLE-STMT  ? 
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In  this  section  on  parsing,  we  have  illustrated  the  following  characteristics  of  our 
system’s  strategy: 

(1)  the  parallel  processing  of  applicable  rules; 

(2)  constraint  of  the  parsing  process  by  the  use  of  context  sensitive  rules; 

(3)  constraint  of  the  parsing  process  by  the  SNePS  Inference  Package  focusing  on 
active  rule  processes  -  the  manifestation  being  the  blocking  of  multiple  parses 
by  previously  established  expectations; 

(4)  suspension  and  resumption  of  rule  processes  during  the  parsing  process. 

The  retention  of  alternative  analyses  of  a  string,  which  avoids  the  reanalysis  of  cer¬ 
tain  strings  in  the  case  of  a  backtracking  system,  was  not  illustrated  by  the  exam¬ 
ple  of  this  section,  but  is  a  characteristic  of  our  system. 

Also  of  importance  in  this  section  is  the  fact  that  the  system  is  again  using  its 
acquired  language  definition  as  a  metalanguage  to  understand  another  instruction 
from  the  teacher  concerning  the  language  itself. 


3.4  Interpretation  of  the  Input  Rule  Statement 

During  the  interpretation  process,  a  VARIABLE  of  the  user’s  language  is  trans¬ 
lated  into  a  variable  node  of  the  semantic  network.  The  scope  of  a  user  VARI¬ 
ABLE  is  the  utterance  in  which  it  occurs.  The  association  of  a  user  VARIABLE  to 
its  interpretation  is  maintained  on  a  list  only  during  translation  of  the  utterance  in 
which  the  VARIABLE  occurs. 

The  interpretation  of  a  user  VARIABLE  is  as  follows:  If  a  VARIABLE  is  used 
as  the  VAR-NAME  of  a  VAR-APPOSITION-PHR,  discussed  briefly  in  Sec¬ 
tion  2.2.1,  then  the  system  uses  the  interpretation  of  the  MAIN-APPOS-PHR  as 
the  interpretation  of  the  VARIABLE,  and  stores  this  association  on  the  variable 
association  list.  Otherwise,  the  system  checks  the  variable  association  list  for  a 
corresponding  interpretation  already  established.  Otherwise,  a  new  variable  node 
is  created  as  the  interpretation  of  the  user  VARIABLE,  the  new  pair  once  again 
being  added  tc  the  variable  association  list 

As  shown  in  Figure 25,  the  phrase  “A  NOUN-PHRASE  X”  was  recognized  by 
the  system  as  a  VAR-APPOSITION-PHR,  with  “A  NOUN-PHRASE”  recognized 
as  the  MAIN-APPOS-PHR  and  ’X  as  the  VAR-NAME.  Thus  the  interpretation  of 
the  phrase  “A  NOUN-PHRASE”  is  remembered  by  the  system  as  the  interpreta¬ 
tion  of ’X.  The  string  ‘‘A  NOUN-PHRASE”  has  been  recognized  as  an  INDEF-S- 
PHRASE  and  thus  the  semantic  rule 

INDEF-S-PHRASE  >  >  [BSTR  STRING  S-CAT  STRC  -r] 

of  Section  2.5.3  applies.  As  discussed  in  Section  2.5.3,  if  a  constituent  is  mentioned 
in  a  semantic  rule  but  is  missing  from  the  surface  string  to  which  the  rule  applies, 
then  the  system  represents  the  interpretation  of  the  constituent  as  an  atomic  node. 
Furthermore,  this  atomic  node  is  a  variable  node  in  the  context  of  a  RULE-STMT. 
Since  the  slot-filler  constituent  of  category  STRING  is  not  present  in  our  example 
INDEF-S-PHRASE,  an  atomic  variable  node  (V2  of  Fig. 26)  is  built  to  represent 
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the  interpretation  of  the  missing  STRING  constituent  The  representation  of  the 
interpretation  of  the  S-CAT  constituent  “NOUN-PHRASE”  is  node  B25,  repre¬ 
senting  the  category  of  NOUN-PHRASEs.  The  STRC  slot-filler  becomes  the 
interpretation  of  the  INDEF-S-PHRASE.  This  slot-filler  is  also  represented  by  an 
atomic  variable  node  (VI  of  Fig.  26)  as  explained  in  Sect  2.5  J.  The  +  symbol  in 
the  rewrite  rule  marks  the  participant  of  the  proposition  represented  by  the  case 
frame  whose  representation  is  also  the  representation  of  the  interpretation  of  the 
INDEF-S-PHRASE.  Thus  the  interpretation  of  the  INDEF-S-PHRASE  “A 
NOUN-PHRASE”  is  node  VI  of  Figure  26.  That  is.  the  INDEF-S-PHRASE  is 
interpreted  as  a  variable  node  to  be  instantiated  by  a  structure  representing  an 
analyzed  surface  string  which  has  an  associated  bounded-string  (see  Sect.  2.2.3) 
and  category  NOUN-PHRASE.  VI  is  also  the  interpretation  of  user  VARIABLE 
’X  due  to  the  string  “A  NOUN-PHRASE  X”  being  a  VAR-APPOSITION-PHR 
and  the  association  of  VI  and  ’X  is  stored  on  the  variable  association  list. 

The  input  string  “THE  HEAD-NOUN  OF  A  NOUN-PHRASE  X”  was  parsed 
as  a  CONSTIT-PHRASE  (refer  to  Fig.  26).  The  rule  for  interpreting  a  CONSTIT- 
PHRASE  was  given  in  Seaion  3.3  as 

CONSTIT-PHRASE  >  >  [CONSTIT  -I-  CONSTITOF  SUP-STRING- 

REF] 

(BSTR  STRING  CAT  DEF-S-PHRASE  STRC 

+I 

This  rule  stipulates  that  the  interpretation  of  a  CONSTIT-PHRASE  is  a  partici¬ 
pant  in  two  case  frames  as  defined  in  the  two  sets  of  brackets  and  the  -f  symbol 
marks  the  slot-filler  that  is  the  interpretation  of  the  CONSTIT-PHRASE.  Again  an 
atomic  variable  node  is  built  to  represent  this  slot-filler  which  also  represents  the 
CONSTIT-PHRASE.  The  SUP-STRING- REF  ist  the  constituent  “A  NOUN¬ 
PHRASE  X”  (refer  to  Fig.  26),  whose  interpretation  is  represented  by  node  VI  of 
Figure  27.  The  structure  representing  the  case  frame  defined  in  the  second  set  of 
brackets  is  built  in  a  manner  similar  to  that  used  in  building  the  struaure  of  Fig¬ 
ure  27  and  dwcribed  above. 

The  interpretation  of  the  example  CONSTIT-PHRASE  “THE  HEAD-NOUN 
OF  A  NOUN-PHRASE  X”  is  represented  by  node  V3  of  Figure  28.  The  node  VI 
of  Figure  28  ist  the  same  node  as  VI  of  Figure  27. 

Assembling  the  interpretations  of  the  constituents  of  our  RULE-STMT  from 
Figures  27  and  28  and  completing  the  interpretation  of  the  RULE-STMT  as  the 
system  does  using  the  semantic  rewrite  rules  of  this  article,  node  M86  of  Figure  29 
represents  the  interpretation  of  the  RULE-STMT.  All  of  the  variable  nodes  VI,  V2, 
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Fig.28.  Node  V3  represents  the  interpretation  of  the  CONSTIT-PHRASE 
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Fig.  29.  Node  M86  represents  the  interpretation  of  the  input  rule 


V3,  V4,  and  V5  are  universally  quantified  (refer  to  Shapiro  (1979a)  for  the  network 
representation  of  the  quantification  that  is  not  shown  in  the  figure  and  for  more 
details  on  the  rule  structures  of  SNePS).  The  &ANT  and  CQ  arcs  are  the  SNePS 
system  arcs  used  in  the  network  representation  of  “&-entailment”,  the  entailment 
of  any  of  a  set  of  consequents  by  the  conjunction  of  one  or  more  antecedents. 


4  Language  Use-Mention  Distinction 

In  order  for  our  system  to  treat  linguistic  knowledge  as  domain  knowledge  and  to 
receive  instruction  in  the  use  of  this  knowledge,  it  is  essential  for  the  system  to  dis¬ 
tinguish  between  use  and  mention  of  language  (Quine,  1951).  We  have  already 
seen  examples  of  this  capability  in  our  system.  When  words  are  entered  into  their 
appropriate  lexical  categories  as  in  Section  2.5.2  a,  they  are  mentioned.  Those  lex¬ 
emes  that  are  themselves  names  of  lexical  categories  are  subsequently  used  to  refer 
to  their  corresponding  lexical  categories.  For  example,  the  word  ’^RB  is  men¬ 
tioned  when  entered  into  the  category  L-CAT  of  lexical  category  names  and 
is  subsequently  used  to  refer  to  the  category  of  verbs  (see  Sect  2.5.2).  The 
word  ’GOOSE  is  mentioned  in  the  example  sentence  of  Section  2.6  when  specify¬ 
ing  that  its  number  is  singular,  but,  in  a  similar  sentence,  the  word  ’GRADY  is 
used. 

As  a  more  sophisticated  example  combining  use  and  mention,  we  illustrate  ouf 
system’s  processing  of  an  equivalent  version  of  the  classic  sentence  of  Tarski 
(1944)  “‘SNOW  IS  WHITE’  IS  .TRUE  IF  AND  ONLY  IF  SNOW  IS  WHITE”. 
We  do  not  treat  truth  relative  to  possible  worlds.  Our  semantic  network  represents 
only  the  belief  space  of  the  system,  and  asserted  propositions  are  those  believed  by 
the  system. 

We  continue  to  build  upon  the  language  definition  thus  far  input  to  the  system 
in  this  article.  The  additional  lexical  entries  that  we  input  are: 

L-CAT  —  ’MASS-NOUN  ADJECTIVE  —  ’TRUE 
PROPERTY  TRUTH-VALUE  ADJECTIVE  —  ’FALSE 
MASS-NOUN  —  ’SNOW 

We  explain  to  the  system  that 

TRUE  IS  A  TRUTH-VALUE 
FALSE  IS  A  TRUTH- VALUE 

to  be  parsed  and  interpreted  by  the  system  as  PROPERTY-CLASS-ENTRIES  as 
shown  in  Section  2.6.  Additional  syntax  rules  such  as  the  following  are  needed: 

NOUN-PHRASE  ^  MASS-NOUN 
NOUN-PHRASE  —  LITERAL-STRING 

Upon  input  of  the  sentence 

If  SNOW  IS  WHITE  THEN  “SNOW  IS  WHITE"  IS  TRUE 

the  system  builds  the  parse  tree  shown  in  Figure  30  for  the  utterance. 

Applying  the  teacher’s  rules,  the  system  builds  the  network  rule  of  Figure  31  as 
the  interpretation  of  the  input  sentence.  Node  M92  represents  the  generic  string 
“SNOW  IS  WHITE”  and  not  just  an  instance  of  the  string.  Node  M92  dominates 
a  pattern  that  is  matched  by  any  instance  of  the  string,  with  V8  a  universally 
quantified  variable  node. 

If  the  system  believes  that  snow  is  white,  then  the  rule  shown  in  Figure  31  is 
used  appropriately  and  if  we  query  the  system  regarding  any  instance  of  the  string 
“SNOW  IS  WHITE”  it  indicates  that  the  string  is  true. 
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To  complete  the  original  bi-conditional  statement,  the  converse  statement 

IF  “SNOW  IS  WHITE”  IS  TRUE  THEN  SNOW  IS  WHITE 

can  also  be  entered  the  system  and  the  converse  of  the  rule  of  Figure  31  is  built 
into  the  network  as  its  interpretation. 


5  Summary 

This  article  has  presented  our  approach  to  NLU:  an  approach  that  focuses  on  the 
capability  of  a  natural  langauge  to  be  used  as  its  own  metalanguage.  It  is  essential 
to  this  approach  to  have  the  system’s  parsing  and  linguistic  knowledge  be  an  inte¬ 
gral  part  of  its  domain  knowledge.  It  is  our  view  that  linguistic  knowledge  about  a 
word  or  phrase  is  a  part  of  its  meaning  or  significance  and,  furthermore,  there  is 
no  clear  boundary  line  separating  syntaaic,  semantic,  and  world  knowledge.  For 
these  reasons  we  represent  linguistic  knowledge  along  with  other  domain  knowl¬ 
edge  in  an  integrated  knowledge  base.  Furthermore,  the  linguistic  rules  of  the  sys¬ 
tem’s  knowledge  base  comprise  the  system’s  knowledge  of  language  understand¬ 
ing  in  the  same  way  that  the  rules  of  any  rule-based  system  comprise  that  system’s 
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Fig.31.  Interpretation  of  the  input  utterance  “IF  SNOW  IS  WHITE  THEN  ‘SNOW  IS  WHITE’ 
IS  TRUE“ 


knowledge  of  its  domain  of  application.  Our  system  also  incorporates  the  use- 
mention  distinction  for  language. 

We  are  exploring  the  possibility  of  a  NLU  system’s  becoming  more  adept  in  its 
use  of  some  language  by  being  instructed  in  the  use  of  the  language.  We  wish  this 
explanation  to  be  given  in  an  increasingly  sophisticated  subset  of  the  language 
being  taught.  The  system  must  start  with  some  language  facility,  and  we  are  inter¬ 
ested  in  seeing  how  small  and  theory-independent  we  can  make  the  initial  kernel 
language. 

In  this  chapter,  we  have  discussed  the  core  knowledge  and  representations  of 
our  system,  including  the  kernel  language,  which  consists  of  predefined  terms,  and 
syntactic  and  semantic  rewrite  rules  with  which  to  bootstrap  into  a  more  sophisti¬ 
cated  language  definition.  We  have  demonstrated  the  capability  of  increasing  the 
system’s  language  facility  by  using  the  very  same  facility  to  instruct  the  system 
about  language  understanding.  We  built  up  the  system’s  capability  to  the  stage  at 
which  it  processed  the  sentence  “IF  THE  HEAD-NOUN  OF  A  NOUN-PHRASE 
X  HAS  NUMBER  Y  THEN  X  HAS  NUMBER  Y“.  We  presented  additional 
examples  of  language  being  treated  as  the  topic  of  discourse  including  the  sys¬ 
tem’s  parsing  and  interpretation  of  the  sentence  “'SNOW  IS  WHITE’  IS  TRUE 
IF  AND  ONLY  IF  SNOW  IS  WHITE”. 
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We  discussed  the  system’s  parsing  strategy,  which  is  a  combined  bottom-up, 
top-down  strategy.  Our  system’s  parser  is  a  general  rule-based  inference  system  in 
which  applicable  rules  are  activated  in  parallel  in  the  form  of  processes  or 
demons.  The  inference  system  employs  bi-directional  inference  to  cut  down  the 
fan  out  of  pure  forward  or  backward  chaining. 
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Appendix  Chronological  Summary  of  Input  to  the  System 
as  Resented  in  This  Chapter 


Section  Input 
Number 


152  L-CAT  — 'NOUN 

2.5.2  L-CAT— ‘PROPER-NOUN 

152  LCAT^  ‘DEF-DET 

152  L-CAT—’INDEF-DET 

152  L-CAT-**VERB 

2.51  L-CAT  — ‘BE-VERB 

2.5.2  L-CAT— ‘ADVERB 

2.51  L-CAT — ‘ADJECTIVE 

152  LCAT— ‘PREPOSITION 

152  L-CAT— ‘CO  NJ 

2.51  L-CAT— ‘PROPERTY 

152  S-CAT— ‘HEAD->'OUN 

152  S-C  AT— ‘STRING 

152  VARIABLE  — ‘X 

152  VARIABLE  — ‘Y 

2.51  NOUN  — ‘GOOSE 

151  NOUN  — ‘GEESE 

2.51  PROPER-NOUN  —  ‘GRADY 

2.51  PROPER-NOUN  —  ‘GLADYS 

2.51  DEF-DET— ‘THE 

2.51  INDEF-DET— 'A 

2.51  VERB  — ‘HAS 

151  BE-VERB  — ‘IS 

151  ADVERB  — THEN 

151  ADJECTIVE  —  ‘WHITE 

2.52  ADJECTIVE  —  ‘SINGULAR 

151  ADJECTIVE— ‘PLURAL 

151  PREPOSITION  —  ‘OF 

2.51  CONJ  — ‘IF 

2.51  PROPERTY  —  COLOR 

151  PROPERTY— ‘NUMBER 

151  UNIQUE-MEANING-CAT— ‘ADJECTIVE 

2J1  UNIQUE-MEANING-CAT— ’PROPERTY 

2  5.2  PROPERTY-CLAUSE  —  SUBJECT  PREDICATE 
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Number 


2.5.2  SUBJECT—  NOUN-PHRASE 

2.5.2  NOUN-PHRASE— LITERAL 

2.5.2  NOUN-PHRASE— VARIABLE 

2.52  NOUN-PHRASE- PROPER-NOUN 

2.5.2  PREDICATE  —  RELATION-PREDICATE 

2.5.2  PREDICATE— BE-PREDICATE 

2.5.2  RELATION-PREDICATE— RELATION  PREDICATE-ADJ 

2.5.2  BE-PREDICATE— BE-VERB  PROPERTY-INDICATOR 

2.5.2  RELATION  —  ’HAS  PROPERTY-INDICATOR 

2.5.2  PROPERTY-  INDICATOR—  PROPERTY-CLASS-INDICATOR 

2.5.2  PROPERTY-INDICATOR— PROPERTY 

2J2  PROPERTY-CLASS-INDICATOR  —  PREDICATE-ADJ 

2.5.2  '  RULE-STMT— ’IF  ANT-CLAUSE  THEN  CQ-CLAUSE 

2.5.2  RELATION  PREDICATE-ADJ  —  RELATION  ADJECTIVE 

2J.2  RELATION  PREDICATE-ADJ  —  RELATION  VARIABLE 

15.2  BE-VERB  PREDICATE-ADJ  —  BE-VERB  ADJECTIVE 

15.2  ’IF  ANT-CLAUSE  —  ’IF  PROPERTY-CLAUSE 

15.2  THEN  CQ-CLAUSE  —  THEN  PROPERTY-CLAUSE 

15  J  PROPERTY-CLAUSE  : :  PROPERTY  SUBJECT 

PROPERTY  PROPERTY-INDICATOR 
VALUE  PREDICATE-ADJ 
2.5-3  ANT-CLAUSE  : :  PROPERTY-CLAUSE 

15-3  CQ-CLAUSE  : :  PROPERTY-CLAUSE 

15J  SUBJECT  : :  NOUN-PHRASE 

15J3  NOUN-PHRASE::  LITERAL 

15  J  NOUN-PHRASE : :  VARIABLE 

2.53  NOUN-PHRASE:  :  PROPER-NOUN 

2.53  PROPERTY-INDICATOR  : :  PROPERTY-CLASS-INDICATOR 

2.5.3  PROPERTY-INDICATOR  : :  PROPERTY 

233  PREDICATE-ADJ  :  ADJECTIVE 

2.53  PREDICATE-ADJ  : :  VARIABLE 

153  PROPERTY-CLASS-INDICATOR  >  > 

(MEMBER  ADJECTIVE  PROPERTY-CLASS  -(-] 

2.53  INDEF-S-PHRASE  >  >  (BSTR  STRING  CAT  S-CAT  STRC  -I-) 

16  GRADY  HAS  COLOR  WHITE 

2.6  ’GOOSE  HAS  NUMBER  SINGULAR 

2.6  PROPERTY-CLASS-ENTRY  —  ADJECTIVE  ’IS  ’A  PROPERTY 

2.6  PROPERTY-CLASS-ENTRY  ; :  MEMBER  ADJECTIVE 

PROPERTY-CLASS  PROPERTY 

2.6  WHITE  IS  A  COLOR 

16  PLURAL  IS  A  NUMBER 

2.6  GLADYS  IS  WHITE 

2.6  ’GEESE  IS  PLURAL 

3.2  DEF-S-PHRASE  —  DEF-DET  S-CAT 

33  INDEF-S-PHRASE  —  INDEF-DET  S-CAT 

3.2  MAIN-APPOS-PHR  VAR-NAME—  INDEF-S-PHRASE  VARIABLE 

3.2  VAR-APPOSITION-PHR~  MAIN-APPOS-PHR  VAR-NAME 

3.2  SUP-STRING-REF  —  VAR-APPOSITION-PHR 

3.2  CONSTIT-REF  —  DEF-S-PHRASE 

3.2  CONSTIT-PHRASE—  CONSTIT-REF  'OF  SUP-STRING-REF 

3  2  NOUN-PHRASE  —  CONSTIT-PHRASE 
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32  CONSTIT-PHRASE  >  >  (CONSTIT  +  CONSTITOF  SUP-STRING-REF] 

(BSTR  STRING  CAT  DEF-S-PHRASE  STRC  -H 

3.2  SUP-STRING-REF  : :  VAR-APPOSITION-PHR 

3.2  MAIN-APPOS-PHR::  INDEF-S-PHRASE 

3.2  NOUN-PHRASE  ::  CONSTIT-PHRASE 

3.2  DEF-S-PHRASE  ::S-CAT 

3J  IF  THE  HEAD-NOUN  OF  A  NOUN-PHRASE  X  HAS  NUMBER  Y 
THEN  X  HAS  NUMBER  Y 

4  L-CAT  —'MASS-NOUN 

4  PROPERTY  —  TRUTH- VALUE 

4  MASS-NOUN— 'SNOW 

4  adjective  —true 

4  ADJECTIVE  — 'FALSE 

4  TRUE  IS  A  TRUTH-VALUE 

4  FALSE  IS  A  TRUTH-VALUE 

4  NOUN-PHRASE— MASS-NOUN 

4  NOUN-PHRASE  —  LITERAL-STRING 

4  IF  SNOW  IS  WHITE  THEN  “SNOW  IS  WHITE"  IS  TRUE 

4  IF  “SNOW  IS  WHITE"  IS  TRUE  THEN  SNOW  IS  WHITE 
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ABSTRACT 

We  present  .1  n«-s^  lesign  technique  for  constructing  an  incrementally- 
built,  compiled,  producimn  s\stem  .A  TN  machine.  This  technique,  the  re¬ 
entrant  state  ntiwiule  let  hnujue.  makes  it  possible  to  have  a  highly  modular¬ 
ized,  compiled  .A  I  N  machine  that  allows  fle.\ible  search  control  by  having  a 
variable  entry  point  included  in  the  configuration  frame  of  every  state  module. 
We  have  shoun  this  solution  to  be  practically  fea.sible  by  implementing  an 
efficient  ATN  compiler  of  this  design. 


1.  INTRODUCTION 

The  properties  of  the  Augmented  Transition  Network  (ATN)  formalism  have  been  inten¬ 
sively  investigated  by  a  number  of  researchers  (Thorne,  Bratley  &  Dewar  (1968),  Bobrow  & 
Fraser  (1969),  Woods  (1970;  1973;  1978;  1980),  Kaplan  (1972;  1973a:  1973b;  1975),  Bates 
(1978)1.  The  expressive  power  of  the  ATN  formalism  is  at  least  as  great  as  that  of  transforma¬ 
tional  grammars.  The  recursive  nature  of  an  ATN’s  use  of  PUSH  arcs  to  find  non-terminal 
constituents  gives  it  the  p*)>er  of  handling  phrase-structure  rules,  while  its  register  handling 
capability,  including  the  effects  exhibited  by  VIR  arcs  coupled  with  the  HOLD  action,  gives  it 
the  power  of  handling  context-sensitive  transformational  rules.  For  this  reason,  ATNs  have 
been  widely  used  to  express  parsing  grammars  for  many  formal  or  natural  language  processing 

'  This  work  was  supported  in  part  by  the  Air  Force  Systems  Command.  Rome  Air  tievelopment  Center. 

(iriffiss  .Air  Force  Base,  New  York  13441-5700.  and  the  Air  Force  Office  of  Scientific  Research,  Bolling  .AFB  PC 
30.112  under  contract  Na  F30602-85-C-0008.  Bill  Rapaport’s  valuable  comments  on  earlier  drafts  of  this  pa 
per  w.Ls  gre.ntlv  .appreciated. 
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systems  (Woods,  Kaplan  &  Nash-Webber  (1972).  Woods  et  al.  (1976),  Waltz  (1976),  Bobrow  et 
ai.  (1977),  Wanner  Maratsos  (1978),  Kwasny  &  Sondheimer  (1981),  McKay  &  Martins 
( 1981 ),  Weischedel  &  Sondheimer  (1984),  Shapiro  (1982),  Shapiro  &  Kapaport  (1985)].  Shapiro 
(1975;  1982)  ha.s  shown  that  a  single  ATN  formalism  can  be  used  to  express  grammars  for 
both  sentence  recognition  and  sentence  generation;  thus,  ATNs  may  be  used  for  bidirectional 
symbol  transductions  between  surface  linguistic  forms  and  internal  semantic  representations. 

Since  hardware  ATN  machines  are  not  yet  available,  ATNs  are  processed  by  ATN  virtual 
machines  emulated  by  software.  Usually,  ATN  programs  (grammars)  are  processed  by  an  ATN 
interpreter,  another  program  usually  written  in  Lisp.  An  ATN  interpreter  is  a  general-purpose 
ATN  virtual  machine  in  the  sense  that  it  can  be  used  to  run  any  ATN  program,  However,  sys¬ 
tems  using  ATN  interpreters  tend  to  be  very  slow.  Another  way  of  building  an  ATN  virtual 
machine  is,  to  combine  a  given  ATN  program  with  the  emulating  program  into  one  homogene¬ 
ous  unit.  However,  such  an  ATN  machine  is  only  good  for  the  A  I'N  program  it  was  made  for. 
We  call  an  ATN  machine  of  this  kind  a  compiled  ATN  machine.  By  an  A7’A/  compiler,  we 
mean  a  special  device  (or  program)  that  builds  a  compiled  ATN  machine  when  a  particular 
A  I'N  program  is  given.  The  input/output  relations  for  the  various  (virtual)  machines  we  have 
defined  are  depicted  in  Figure  1. 

Figure  1.  Input/output  relations  of  various 
ATN-related  machines 

The  obvious  advantage  of  compiling  an  .ATN  is  faster  execution,  although  this  is  probably  at 
the  expense  of  a  larger  program  space. However,  in  most  computing  environments,  space  prob¬ 
lems  are  usually  efficiently  buffered  by  the  operating  system,  leaving  users  with  no  significant 
penalty  for  this  trade-off.  Therefore,  designing  a  good  ATN  compiler  is  an  interesting  issue  to 
be  addressed  in  computational  linguistics.  In  this  paper,  we  first  examine  'he  properties  of  an 
ATN  parser  as  an  abstract  machine  and  compare  two  different  models  for  .ATN  machines.  We 

■  Hinin  (1^83:37)  reports  thnt  he  obMmed  a  size  reduction  of  the  program  space  os  well  through  his  ATN 
c<^mpilation. 
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then  analyze  the  advantages  and  problems  associated  with  various  approaches  to  ATN  com¬ 
pilers  and  present  a  solution  to  a  problem  faced  by  what  is  arguably  the  best  approach. 
Finally,  we  provide  a  brief  description  of  an  ATN  interpreter  due  to  Kwasny  and  Shapiro 
(henceforth,  the  Kwasny-Shapiro  interpreter  that  we  believe  is  a  suitable  ATN  virtual 
machine  by  the  standards  discussed  here,  and  present  our  implementation  of  an  ATN  compiler 
based  on  that  interpreter.  However,  the  applicability  of  our  design  technique  for  the  compiler 
is  not  limited  to  the  Kwasny-Shapiro  interpreter. 


2.  TWO  MODF.1.S  OF  ATN  MACHINES 


When  an  A  TN  program  G  for  a  language  L  is  appropriately  loaded,  an  ATN  machine 
MG  tests  whether  an  input  S  is  a  sentence  of  L,  possibly  producing  an  analysis  of  5  as  a 
side-effect  if  5  is  a  sentence  of  L.  At  this  point,  it  is  not  our  immediate  concern  whether  MG 
is  an  actual  machine  or  a  virtual  machine  implemented  by  a  program  running  on  another 
machine.  Neither  does  it  concern  us  whether  G  is  explicitly  loaded  into  MG  or  is  a  part  of 
MG  itself.  We  view  an  ATN  program  as  an  expression  of  a  search  space  consisting 
exclusively  of  OR-trees.  Each  ATN  state  is  a  node,  with  the  initial  state  being  the  root  node. 
Each  arc  from  a  given  ATN  state  is  an  edge  from  the  node  for  the  state.-’  If  5  is  a  sentence  of 
L,  It  is  mapped  to  a  set  of  paths  from  the  root  to  a  terminal  in  the  search  space,  where  each 
such  path  represents  a  unique  analysis  of  5.  If  5  is  not  a  sentence  of  L,  it  fails  to  be  mapped 
to  any  path  starting  f rom  the  root  to  a  terminal.  On  this  view,  a  process  of  the  ATN 
machine’s  computation,  which  we  will  call  ATN  parsing,  is  essentially  an  OR -tree  search  for 
a  solution  path  or  a  set  of  solution  paths  in  the  search  space  defined  by  G.  where  the  search  is 


^  One  m.iy  alternativelv  view  the  search  space  made  by  an  ATN  program  as  a  directed  graph  in  which  a 
single  node  can  be  reached  through  more  than  one  path.  The  parsing  technique  of  keeping  a  substitution 
table  for  well  formed  substrings  appears  to  be  based  on  this  view  [Tennant  (1981:  67,  270)).  However, 
whether  a  string  analyzed  into  a  certain  syntactic  structure  will  have  an  invariant  semantic  interpretation 
regardles;  of  its  context  is  as  yet  an  open  question.  If  the  purpose  of  parsing  is  more  than  a  mere  acceptabilr 
ty  test  of  the  ^npux  form,  it  may  sometimes  turn  out  to  be  inadequate  to  associate  a  well  formed  substring 
with  the  results  recorded  in  the  table  due  to  a  previously  parsed  substring  which  was  found  in  a  different 
oath  environment.  We  will  not  pursue  this  issue  here,  since  it  is  not  directlv  relevant  and  our  model  is  nei 
ther  dependent  on  nor  incompatible  with  this  view. 


2B-101 


conditioned  by  the  input  S. 


The  central  component  of  an  ATN  machine  is  the  part  that  controls  the  search  in  this 
OR-tree  search  space.  There  are  two  different  models  for  managing  this  control  on  a  sequential 
machine;  the  host-recursion  model  and  the  production-system  modeL 

The  host-recursion  model  utilizes  the  host  machine’s  recursive  control  and  internal  stack. 
Here,  each  search  step  in  the  ATN  search  space,  such  as  a  transition  from  a  state  SI  to  another 
state  S2  via  an  arc  C,  is  realized  as  a  direct  call  to  the  program  module  corresponding  to  C 
from  the  program  context  for  SI  in  order  to  create  a  program  context  for  S2  embedded  one 
level  further  down  in  the  control  stack.  Since  an  ATN  state  transition  is  realized  as  a  recur¬ 
sive  call  at  the  host  machine  level,  the  configuration  frame^  of  each  ATN  state  can  be  main¬ 
tained  in  the  host  system’s  run-time  stack  of  activation  records.  For  this  reason,  the  ATN 
machine’s  backtracking  is  transparently’  managed  by  the  host  machine’s  own  recursive  control 
and  its  run-time  stack.  A  depth -first  search  strategy  can  easily  be  implemented  in  a  host- 
recursion  ATN  machine.  This  method  makes  the  data  flow  and  control  flow  of  an  ATN 
machine  very  simple  and  effortless.  But  its  drawback,  besides  the  potential  danger  of  the  host 
machine’s  stack  overflow,  is  that  the  control  of  the  ATN  search  is  tied  to  the  host  program¬ 
ming  language’s  data  and  control  structures.  Because  of  this,  it  becomes  relatively  more 
difficult  to  allow  the  ATN  user  to  exercise  control  over  the  order  of  search  during  parse  time. 
The  implementation  of  multiple  parsing  on  a  sequential  machine  also  turns  out  to  be  quite 
cumbersome  in  this  model  because  of  the  unnatural  forcing  of  backtracking  after  a  successful 
return  of  a  subroutine  call.^ 

In  a  production-system  model,  on  the  other  hand,  the  control  driver  of  an  ATN  machine 
is  an  indefinitely  iterated  execution  of  search  processes.  It  stops  only  when  an  interrupt  flag  is 
set  because  either  the  search  processes  have  been  exhausted  or  the  goal  of  the  computation  has 

By  this,  we  mean  the  ATN-level  dau  objects  relevant  to  a  sute  such  as  the  state  name,  the  present  input 
buffer,  the  path  history,  etc,  contrasted  to  the  data  objects  meamngrul  only  at  the  ATN  machine  emulation 
level. 

'  This  IS  usually  .ichieved  by  the  host  language’s  racility  for  a  forced  backtracking,  such  as  THROW  and 
CATCH  found  in  some  Lisps  [Christaller  (1983).  Laubsch  &  Barth  (1983)1. 
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been  achieved.  This  is  an  adaptation  of  the  standard  production  system  architecture  [Post 
(1943),  Davis  &  King  (l977;  30)1.  In  each  cycle  of  the  production,  a  process  is  taken  out  of  the 
process  queue  and  executed.  The  effect  of  a  process  being  executed  is  the  realization  of  a 
further  step  of  the  search  in  the  OR-tree  search  space,  possibly  resulting  in  the  creation  of  a 
number  of  new  successor  processes  appropriately  inserted  in  the  process  queue  according  to 
their  respective  search  priority.  Each  process  has  a  configuration  frame  that  specifies  the 
actions  to  be  uken  for  the  particular  search  step  and  the  data  on  which  those  actions  are  to  be 
applied.  Every  arc  of  each  ATN  state  is  realized  as  such  a  process,  and  an  ATN  state  is  realized 
as  a  group  of  processes  that  share  the  same  configuration  frame  except  for  the  actions  to  be 
taken.  An  ATN  machine  of  this  model  keeps  possession  of  the  top-level  control  of  the  ATN 
parsing  because  of  explicit  maintenance  of  the  process  queue  and  the  configuration  frames  for 
the  processes  in  the  que  ie.  Since  the  process  queue  is  maintained  at  parse  time,  this  model 
allows  the  order  of  the  search  to  be  controlled  flexibly  during  parse  time  by  a  priority  policy 
for  process  scheduling.  Thus,  the  order  of  the  search  is,  in  principle,  dynamically  determined 
at  the  time  when  an  ATN  program  runs,  rather  than  at  the  time  when  the  program  is  loaded 
into  the  ATN  machine. 

According  to  Winogiad  (1983:  261),  “the  scheduling  strategy  is  an  important  theoretical 
property  of  the  ATN  formalism.  Unlike  the  parsers  that  are  tied  to  a  specific  processing  order 
_  an  ATN  parser  can  make  use  of  a  grammar  that  was  written  without  a  specific  processing 
regime  in  mind.”  Thus,  an  ATN  machine  with  no  capability  for  managing  its  own  scheduling 
control  can  probably  be  considered  as  theoretically  incomplete,  despite  Finin’s  (1983)  defense  of 
his  depth-first  backtracking  implementation  of  an  ATN  machine.  Therefore,  even  if  a  particu¬ 
lar  implementation  does  not  elaborate  a  policy  of  scheduling  priority  for  ATN  parsing  and 
ends  up  with  an  extremely  trivial  algorithm  that  makes  it,  in  effect,  nothing  more  than  a 
depth-first  search,  we  believe  that  the  basic  architecture  of  an  ATN  machine  should  be  designed 
in  such  a  way  that  it  is  capable  of  incorporating  flexible  search  control  when  desired.  For  this, 
we  conclude  that  the  choice  between  these  two  ATN  models  is  more  a  practical  issue  than  a 
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theoretical  one,  and  we  believe  that  choosing  the  architecture  of  an  ATN  machine  with  a  pre¬ 
determined  search  control  by  choosing  the  host-recursion  model  is  comparable  to  choosing  a 
“quick  and  dirty”  solution.* 

The  advantage  of  the  production-system  model  over  the  host-recursion  model  docs  not 
stop  at  the  ATN  machine’s  capability  for  dynamic  scheduling  of  the  search  order  at  parse  time. 
Since  each  search  process  is  represented  in  the  machine  declaratively  rather  than  procedurally, 
it  is  also  very  simple  in  this  model  to  incorporate  a  debugging  facility  to  be  used  to  monitor 
the  behavior  of  the  parsing,  as  will  be  shown  in  our  later  discussion. 

3.  EVOLUTION  OF  A'I'N  COMPILER  CONSTRUCTION 

The  approach  to  ATN  compiler  construction  has  been  changing  from  the  earlier  integral 
compilation  approach  to  the  present  incremental  compilation  approach.  In  the  integral  compi¬ 
lation  approach  [Burton  (l976).  Burton  &  Woods  (1976)),  the  compiler  compiles  an  ATN  pro¬ 
gram  into  one  huge  f ragment  of  an  ATN  object  program,  which  is  characteristically  a  single 
loop  around  a  case  statement  (or  a  computed  GOTO-statement),  where,  in  essence,  each  ATN 
state  is  a  label  for  a  segment  of  code  that  implements  all  the  arcs  emanating  from  that  sute.^ 
On  the  other  hand,  the  incremental  compilation  approach  [Finin  (1977;  1983),  Christaller 
(1983),  Kochut  (1983),  Laubsch  &  Barth  (1983)]  attempts  a  maximum  modularization  of  the 
ATN  object  program  in  such  a  way  that  each  ATN  module  (such  as  an  ATN  state  or  arc)  is 
mapped  to  an  associated  program  module  in  the  compiled  code.  The  relation  between  these  two 
approaches  parallels  the  one  between  the  spaghetti-GOTO  style  of  programming  and  the  struc¬ 
tured  programming. 

The  obvious  advantages  of  incremental  compilation  over  integral  compilation  arc  the 
clarity  of  the  compiled  code  in  the  host  language  and  the  greater  mainuinability  of  the  com¬ 
piled  ATN  program.  Furthermore,  because  the  code  segments  implementing  each  ATN  state  or 

’  For  a  good  sketch  of  this  approach,  s<«  Laubsch  A  Barth  (1983:  152  162). 
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ire  are  deuched  from  the  code  segment  for  the  control  driver,  the  perspicuity  of  the  ATN 
notation  is  better  preserved  in  the  compiled  ATN  program.  Since  each  ATN  module  can  be 
compiled  and  loaded  independently,  the  ATN  programmer  can  simply  modify  a  selected  part 
of  his  source  ATN  program,  and  recompile  and  load  those  particular  modules,  leaving  the  rest 
intact.  Because  of  this  flexibility,  even  compilation  of  an  extremely  large  ATN  program  can  be 
done  piece  by  piece  on  a  small  system  where  the  total  available  space  may  be  restricted. 
Furthermore,  debugging  flags  or  a  monitor  can  very  easily  be  inserted  in  appropriate  object 
modules  corresponding  to  specific  source  modules  of  the  ATN,  enabling  the  user  to  monitor  or 
modify  the  ATN  program  in  debugging  mode  while  running  compiled  code. 

In  general,  however,  excessive  modularization  may  often  sacrifice  some  of  the  program’s 
execution  efficiency.  Depending  on  the  host  language,  an  invocation  of  a  module  may  some- 
tjmes  waste  its  resources  creating  an  activation  record  for  the  sake  of  a  formal  syntactic 
requiremenc  This  might  have  been  unnecessary  if  it  were  a  mere  jump  within  one  module. 
A  segment  of  a  program  is  justified  to  become  a  module  if  a  significant  amount  of  local  dau 
preparation  has  to  be  done  on  entering  its  e.xecution  context  due  to  the  logic  of  the  task.  In 
this  light,  we  argue  that  modularization  of  an  object  ATN  program  for  ATN  source  modules  is 
not  a  case  of  excessive  modularization.  Every  program  component  corresponding  to  an  ATN 
source  module  is  expected  to  have  to  set  up  its  own  data  environment  anyway.  In  fact,  every 
compiled  ATN  program  built  in  the  integral  approach  has  done  this.  Furthermore,  if  the  range 
of  control  transfer  is  widely  spread  over  a  number  of  mutually  exclusive  alternatives  in  a 
large  program  space,  and  the  operating  system  on  the  host  machine  performs  virtual  memory 
management  for  program  space,  the  efficiency  of  main  memory  management  can  be  greatly 
enhanced  by  a  well-modularized  program.  This  is  because  the  logical  segmentation  of  the  pro¬ 
gram  space  can  help  the  swapping  manager  deal  with  naturally  partitioned  memory  blocks. 
In  fact,  because  function  cal'io  are  auch  a  natural  form  of  program  control  in  some  srack 
oriented  languages  like  Lisp,  the  cost  of  module  invocation  should  not  deter  us  from  creating 
logically  well-motivated  modules. 
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Considering  these  factors., we  regard  the  development  from  integral  compilation  to  incre¬ 
mental  compilation  as  a  natural  consequence  of  the  evolution  of  general  programming  style. 
The  rest  of  our  discussion  will  be  limited  to  issues  and  problems  encountered  in  the  incremen¬ 
tal  compilation  approach. 

4.  CURRENT  ISSUES  IN  THE  INCREMENTAL  APPROACH 

A  number  of  designs  for  incremental  ATN  compilers  have  been  presented  [Finin  (1983), 
Christaller  (1983),  Laubsch  &  Barth  (1983),  Kochut  (1983)1.  All  of  them  clearly  address  the 
problems  of  integral  compilation  and  succeed  in  constructing  incremental  compilers.  However, 
they  all  share  the  same  pitfall  of  being  too  closely  tied  to  the  peculiarity  of  the  host  language, 
namely.  Lisp.  These  compiler  designs  are  all  host-recursion  models.  Each  ATN  state  is  mapped 
to  a  program  module  in  the  compiled  code,  but  each  transition  from  one  ATN  state  to  another, 
that  is,  each  step  of  the  search,  is  realized  as  a  call  of  one  Lisp  function  from  another 
Lisp  function.  ITie  basic  schema  of  a  compiled  ATN  state  module  is  an  OR  or  AND  construc¬ 
tion®  over  a  number  of  code  segments  intended  for  each  of  the  alternative  arcs.  The  conse¬ 
quent  limitations  are,  of  course,  all  those  exhibited  by  .^TN  machines  built  on  the  host- 
recursion  model.  One  typical  shortcoming  of  this  relinquishing  of  the  ATN’s  control  to  the 
host  language  is  exhibited  by  Laubsch  &  Barth  (l983).  Their  compiled  ATN  machine  does  not 
handle  HOLD  actions  and  alternative  lexical  interpretations,  and  docs  not  easily  provide  users 
with  flexible  debugging  information.  Although  Finin  (1983.'9)  argues  that  he  had  purposely 
"chosen  to  implement  a  depth-first  backtracking  version",  we  feel  that  there  was  hardly  any 
other  choice  left  once  the  ATN  machine  had  surrendered  its  control  to  the  host  language  (Lisp, 
in  his  case).  Due  to  the  direct  use  of  Lisp’s  own  run-time  suck  and  controls  (such  as  OR.  AND, 
THROW,  and  CATCH  )  to  guide  the  ATN  search,  users  were  deprived  of  the  freedom  of  tog¬ 
gling  the  ^TN  machine  between  single  parsing  and  multiple  parsing  modes.  Once  the  single 

*■  OR  or  AND  is  selecttd  drpcnding  on  whether  one  or  all  possible  parsings  o(  the  input  sentence  are 
sought.  In  .in  actual  implement.ition.  AND  construction  can  be  repl.iced  bv  a  PROG  structure. 
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parsing  policy  is  chosen  as  the  primary  option,  allowing  multiple  parsing  turned  out  to  be  a 
problem  because  a  faked  failure  has  to  be  created  even  after  a  genuine  success  of  a  series  of 
embedded  function  calls  corresponding  to  an  acceptable  ATN  search. 

Although  host-recursion  implementations  take  an  incremental  compilation  approach,  in 
principle,  there  has  been  little  interest  shown  in  breaking  down  a  state  module  further  into 
submodules  corresponding  to  each  arc  of  the  state.  We  will  call  this  arc-module  modulariza¬ 
tion.  We  realize  that  such  implementations  can  achieve  a  superficial  arc-module  modulariza¬ 
tion  merely  by  making  a  slight  modification  to  the  compiler  so  that  it  would  decompose  the 
code  for  every  ATTJ  sute  into  separate  submodules  for  its  arcs.  However,  we  suspect  that  they 
are  right  not  to  do  so  in  the  absence  of  a  compelling  motivation,  since  it  would  mean  even  fas¬ 
ter  consumption  of  Lisp’s  run-time  stack.  In  their  design,  control  over  the  alternative  arcs  of 
an  ATN  sute  is  determined  by  the  compiler  once  and  for  all,  and  then  is  f rozen  inflexibly  in 
the  program  body  of  the  emulating  host  system.  When  no  control  flexibility  of  invocation  is 
preserved,  arc-module  modularization  for  the  sake  of  mere  code  decomposition  turns  out  to  be  a 
question  about  programming  style  rather  than  an  issue  of  theoretical  significance. 

Earlier,  we  pointed  out  that  the  production-system  model  is  a  better  solution  for  an  ATN 
machine  that  preserves  its  own  search  control.  On  this  design,  the  ATN  machine  explicitly 
mainuins  its  own  appropriately  sorted  process  queue.  To  our  knowledge,  there  have  been  no 
discussions  of  the  design  issues  of  compiled  ATN  machines  of  the  production-system  model 
viewed  from  the  incremenul  compilation  approach.  At  first  thought,  it  docs  not  appears  to  be 
too  difficult  to  construct  a  production-system-like  driver  that  would  work  on  a  queue  of 
processes  consisting  of  ATN  sute  modules  that  would  be  pretty  much  like  the  sute  modules  in 
the  host-recursion  model.  But  if  we  decide  to  map  an  ATN  sute,  rather  than  an  arc,  to  the 
smallest  ATN  module  in  the  compiled  ATN  machine,  we  are  faced  with  a  serious  problem. 
Granting  that  a  compiled  program  cannot  be  modified  during  run-time,  it  looks  as  if  we  have 
no  other -choice  but  to  freeze  the  ATN’s  search  order  within  a  sute  at  the  time  of  ATN  pro¬ 
gram  compilation.  Since  every  arc  emanating  from  each  ATN  sute  is  mapped  to  an  ATN 
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later  re-enirance  for  the  remaining  alternative  arcs.  The  problem  is  reduced  to  that  of  how  a 
state  module  can  be  made  re-entrant. 

5.  THE  RE-ENTRANT  STATE  MODULE  TECHNIQUE 

In  the  Kwasny-Shapiro  interpreter  based  on  the  production  system  model,  each  process  in 
the  queue  is  made  of  a  configuration  frame  that  holds  the  source  definition  of  an  ATN  state^ 
being  active  in  the  search  frontier,  as  well  as  other  information  relevant  to  a  process.  Every 
time  a  process  is  fired,  one  and  only  one  of  its  arc  alternatives  is  consumed,  and  the  alternative 
arc  list  of  the  process  is  appropriately  reduced.  When  a  process  is  on  the  queue,  it  corresponds 
to  a  state  module  representing  all  the  alternative  arcs.  However,  when  it  is  fired,  a  process  is 
realized  as  an  arc  module  that  activates  a  specific  search  process  associated  with  a  particular  arc 
instance.  We  call  processes  on  the  queue,  each  mapped  to  an  ATN  state  active  on  the  search 
frontier,  archiprocesses^^  to  distinguish  them  from  actually  expanded,  specific  individual 
search  processes.  When  an  archiprocess  is  fired,  it  chooses  one  of  its  successor  processors  in 
accordance  with  a  priority  policy  to  create  a  successor  archiprocess,  which  is  then  also  inserted 
into  the  queue.  The  successor  archiprocess  inherits  its  configuration  frame,  appropriately 
modified  by  its  own  search  actions,  from  the  parent  process.  When  such  a  child  process  is 
created,  the  parent  archiprocess  also  modifies  its  own  configuration  frame  so  that  it  no  longer 
contains  that  consumed  process  in  its  alternative  list,  or  comj-'etely  removes  itself  from  the 
queue  if  no  alternatives  are  left. 

The  solution  to  the  problem  of  devising  a  re-entrant  state  module  lies  in  the  implicit 
notion  of  archiprocesses  found  in  the  Kwasny-Shapiro  interpreter.  The  process  queue  of  the 
compiled  ATN  machine  can  also  queue  a  sequence  of  archiprocesses,  but  with  the  configuration 
frames  slightly  restructured.  In  a  compiled  ATN  machine,  the  configuration  frame  of  an 

''  A  source  definition  of  a  state  is  a  set  of  source  definitions  of  alternative  arc  instances. 

This  term  is  borrowed  from,  and  is  analogous  to,  the  notion  of  “archiphonemes"  or  “archi-unit*  used  in 
phonology,  as  explained  in  Chomsky  A  Halle  (1968:64). 
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search  process,  a  compiled  production-system  ATN  machine  is  impossible  even  to  imagine 
without,  at  least,  an  implicit  notion  of  arc-module  modularization.  Without  resolving  this 
arc-module  modularization  problem,  it  is  impossible  to  make  the  system  have  the  capability  of 
incorporating  flexible  search  control.  One  might  propose  splitting  state  modules  such  as  the 
above  into  stand-alone  arc  modules  for  every  arc  in  the  state,  and  queue  them  in  the  process 
queue  as  if  they  were  totally  unrelated  to  their  sibling  processes.  However,  all  the  sibling  arcs 
from  one  ATN  state  are,  in  fact,  closely  related  to  one  another  by  the  facts  that  they  inherit  a 
common  configuration  frame  and  that  they  are  competing  alternatives  at  a  same  point  of  the 
search  context.  Because  of  this,  the  compiled  production-system  ATN  machine  should  preserve 
the  ATN  states’  statehood  property,  as  well.  We  therefore  reject  the  approach  that  would 
eliminate  state  modules  in  the  compiled  machine. 

We  conclude  that,  in  order  to  construct  a  compiled  ATN  machine  which  will  meet  the 
requirements  of  capturing  the  ATN  states’  statehood  and  preserve  control  flexibility  over  alter¬ 
native  arcs,  we  will  need  to  map  both  ATN  states  and  arcs  to  modules  of  the  compiled 

production-system  ATN  machine.  The  consequent  problem  is,  then,  how  we  can  arrange  it  so 

I 

that  each  process  in  the  queue  will  corresponding  simultaneously  with  a  state  itself  and  with 
only  one  of  its  arc  instances.  This  is  a  problem  for  the  host-recursion  model,  as  well;  however, 
it  was  not  realized  to  be  a  problem,  since  the  host  system’s  control  automatically  stores  a  state 
module  on  the  run-time  stack  and  invokes  each  arc  alternative  one  by  one  using  the  host 
system’s  backtracking  mechanism.  This  is  done  at  the  expense  of  the  ATN’s  being  limited  to 
the  blind  depth-first  search  policy  (cf.  Section  7.3.1).  We  find  the  notion  of  program  re¬ 
entrance  hidden  in  this  mechanism  of  host  system’s  stack  manipulation  and  automatic  back¬ 
tracking  control.  A  state  module  remains  in  the  stack  waiting  for  re-entry  to  each  of  its  arc 
in-stance.  In  the  production-system  model,  an  ATN’s  own  process  queue  is  used  instead  of  the 
host  system’s  stack.  Thus,  the  solution  to  this  problem  for  a  production-system  ATN  machine 
seems  to  be  to  make  each  process  in  the  queue  be  a  state  module,  where  a  state  module  is  not 
only  consumed  and  exited  from  with  one  arc  tried,  but  also  left  in  the  queue  available  for  a 


2B-109 


later  re-entrance  for  the  remaining  alternative  arcs.  The  problem  is  reduced  to  that  of  how  a 
state  module  can  be  made  re-entrant. 

5.  THE  RE-ENTRANT  STATE  MODULE  TECHNIQUE 

In  the  Kwasny-Shapiro  interpreter  based  on  the  production  system  model,  each  process  in 
the  queue  is  made  of  a  configuration  frame  that  holds  the  source  definition  of  an  ATN  state^ 
being  active  in  the  search  frontier,  as  well  as  other  information  relevant  to  a  process.  Every 
time  a  process  is  fired,  one  and  only  one  of  its  arc  alternatives  is  consumed,  and  the  alternative 
arc  list  of  the  process  is  appropriately  reduced.  When  a  process  is  on  the  queue,  it  corresponds 
to  a  sute  module  representing  all  the  alternative  arcs.  However,  when  it  is  fired,  a  process  is 
realized  as  an  arc  module  that  activates  a  specific  search  process  associated  with  a  particular  arc 
instance.  We  call  processes  on  the  queue,  each  mapped  to  an  ATN  state  active  on  the  search 
frontier,  archiprocesses^^  to  distinguish  them  from  actually  expanded,  specific  individual 
search  processes.  When  an  archiprocess  is  fired,  it  chooses  one  of  its  successor  processors  in 
accordance  with  a  priority  policy  to  create  a  successor  archiprocess,  which  is  then  also  inserted 
into  the  queue.  The  successor  archiprocess  inherits  its  configuration  frame,  appropriately 
modified  by  its  own  search  actions,  from  the  parent  process.  When  such  a  child  process  is 
created,  the  parent  archiprocess  also  modifies  its  own  configuration  frame  so  that  it  no  longer 
contains  that  consumed  process  in  its  alternative  list,  or  completely  removes  itself  from  the 
queue  if  no  alternatives  are  left. 

The  solution  to  the  problem  of  devising  a  re-entrant  state  module  lies  m  the  implicit 
notion  of  archiprocesses  found  in  the  Kwasny-Shapiro  interpreter.  The  process  queue  of  the 
compiled  ATN  machine  can  also  queue  a  sequence  of  archiprocesses,  but  with  the  configuration 
frames  slightly  restructured.  In  a  compiled  ATN  machine,  the  configuration  frame  of  an 

“  A  source.deftnition  of  a  state  is  a  set  of  source  definitions  of  altern.itiv;  arc  instances. 

This  term  is  borrowed  from,  and  is  analogous  to,  the  notion  of  “archiphonemes"  or  “archi-unit"  used  m 
phonology,  as  explained  in  Chomsky  A  Halle  (1%8*4). 
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archiprocess  in  the  queue  holds  a  pointer  to  an  appropriate  compiled  state  module,  instead  of 
remembering  the  source  definition  of  the  alternative  arcs.  A  state  module  is  an  unalterable 
program  module  that  contains  in  it  as  many  distinct  entry-points  as  the  number  of  arcs 
emanating  from  the  state,  making  it  resemble  an  ignition  distributor  of  an  automobile.  Each 
ATN  arc  from  the  state  is  also  mapped  to  a  unique  arc  module  dependent  on  the  state  in  order 
to  achieve  a  higher  degree  of  modularization  along  the  lines  of  the  incremenul  compilation 
approach.  Entering  a  state  module  with  a  specific  entry-point  value  “ignite"  a  specific  arc 
module.  As  Figure  2  illustrates,  each  entrance  to  a  state  module  results  in  the  invocation  of 
one  and  only  one  arc  module,  and  the  state  module,  being  left  in  the  queue,  remains  re-entrant. 

Figure  2.  Block-diagram  of  the  control  flow  through 
a  re-entrant  state  module  to  an  arc  module 

***"*»***««•  ABOUT  HERE  **************** 

We  claim  that  this  new  design  technique,  the  rt-entranl  state  module  technique,  can  be  used  to 
incrementally  build  a  compiled  production-system  ATN  machine  that  allows  flexible  search 
control  at  parse  time.  In  this  design,  as  in  the  Kwasny-Shapiro  interpreter,  every  process  in 
the  queue  is  statically  an  archiprocess  associated  with  an  ATN  state  currently  found  in  the 
search  frontier,  and  a  firing  of  such  a  process  is  dynamically  equivalent  to  an  execution  of  one 
and  only  one  alternative  arc  of  the  associated  state.  The  priority  policy  used  to  decide  which 
arc  is  to  be  selected  (which  is  comparable  to  the  cylinder  firing  order  in  our  automobile  anal¬ 
ogy;  only  depends  on  the  algorithm"  that  dynamically  computes  the  value  of  the  entry-point 
for  each  entrance  to  the  state  module. 

6.  DEscn^ipnoN  OF  THE  kwasny-shapiro'atn  interpreter 


In  this  section,  we  describe  the  organization  of  the  Kwasny-Shapiro  ATN  interpreter", 

"  The  investigation  and  el.iboration  of  this  algorithm  is  certainly  an  issue  related  to  ATN  parsing,  but  ir 
relevant  to  our  discussion  of  the  design  issue.  However,  it  is  obvious,  at  least,  that  the  algorithm  must  not 
generate  one-fntry  value  more  than  once. 

'■  This  was  initially  designed  by  Stan  Kwasny  and  Stuan  C  Shapiro  at  Indiana  University  in  19''e  to 
the  specifications  presented  by  Woods’s  (1970;  1973)  model  [Kwasny  (1974)J  and  has  been  expanded  by  the 
SNePS  Research  Group  at  SUNY  Buffala  with  a  mapr  contribution  by  Gerard  Donlon.  It  presently  includes. 
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an  ATN  virtual  machine  currently  implemented  in  Franz  LISP  and  i  jr»ning,  normally  in  the 
S.'.ePS’^  environment  on  VAX  750s  and  VAX  780s  under  the  Berkeley  Unix  4.3  operating  sys¬ 
tem.  There  are  three  reasons  for  describing  this  interpreter  before  the  ATN  compiler  and  the 
compiled  ATN  machine:  ^i)  it  -was  designed  according  to  the  production-system  model  that  we 
have  been  defending,  (ii)  the  re-entrant  state  module  designing  technique  was  based  on  it,  and 
(iii)  our  compiler  implementation  was  directed  toward  building  a  compiled  ATN  machine 
functionally  equivalent  to  it. 

6.1  ATN  formalism 

The  arcs  presently  supported  by  the  Kwasny-Shapiro  ATN  machine  are:  CAT,  WRD, 
TST,  TO,  JUMP,  PUSH,  (rAI.L.  RCALL,  POP,  VIR,  and  GROUP.  The  syntax  and  semantics  of 
the  ATn  formalism  used  by  this  interpreter  is  described  in  Shapiro  (1982),  except  for  the  fol¬ 
lowing  modifications.  JUMP  arcs  are  allowed  to  succeed  with  an  empty  input  buffer,  because  a 
JUMP  arc  consumes  no  input;  this  relaxation  enables  an  ATN  to  capture  and  express  some  regu¬ 
larities  that  can  best  be  handled  at  the  end  of  a  sentence.  The  test  ENEXTFSENTENCE  and  the 
form  LEX  are  defined  and  given  the  same  semantics  as  described  in  Bates  (1978).  Besides  the 
CALL  arc  proposed  by  Shapiro  (1982),  another  new  ATN  arc,  RCALL,  has  been  added;  it  has 
the  same  syntax  as  a  CALL  arc  (cf.  Shapiro,  1982:14),  namely: 

(RCALL  <state>  <form>  <test>  <preaction  or  action>* 

<register>  <action>*  <terminal  action>  ) 

The  semantics  of  an  RCALL  arc  differs  from  that  of  a  CALL  arc  only  in  that  RCALL  locally 
replaces  its  whole  input  buffer  with  <form>,  while  CALL  globally  prepends  <form>  to  the 
current  input  buffer.  The  new  RCALL  arcs  were  found  to  be  convenient  in  writing  genera¬ 
tion  grammars  in  which  one  often  wants  to  delimit  the  scope  of  the  local  input  buffer  for  a 

hesidts  the  ATN  parser  core,  an  English  morphological  analyzer  and  synthesizer  written  by  Darrel  Joy  at  In 
diana.  and  an  input  interface  written  by  Hanyong  Vuhan  at  SUNT  Buffalo  that  .accepts  sentences  written  in 
normal  English  orthographic  conventions  and  takes  care  of  punctuation  marks  and  capitalized  sentence-initial 
words. 

'  *  SNePS  IS  a  semantic  network  processing  system  that  can  be  used  to  represent,  store,  manipulate 
knowledge  (Shapiro  (1979),  Shapiro  A.  Rapaport  (1985)). 
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CALL  to  one  and  only  one  specific  semantic  network  structure.  GROUP  arcs  have  been  imple¬ 
mented  so  as  to  have  the  same  meaning  as  described  in  Christaller  (1983:75)  and  Winograd 
(1983:265).  A  GROUP  arc  groups  a  number  of  arcs  into  a  set  of  mutually  exclusive  alterna¬ 
tives  such  that  if  the  test  of  any  one  member  of  the  set  succeeds,  the  rest  are  not  even  tried.  A 
GROUP  arc  is  distinguished  from  other  kinds  of  arcs  in  that  it  is  a  meu-arc  [Christaller 
(1983:129)]  intended  to  specify  the  control  of  search  on  a  sequential  machine  rather  than  to 
specify  the  substance  of  a  search. 

6.2  Organization  and  control  structures 

The  Kwasny-Shapiro  ATN  interpreter  consists  of  the  following  Lisp-implemented  com¬ 
ponents:  I/O  handler,  program/data  storage,  control  driver,  and  ATN-operation  modules.  The 
four  main  subcomponents  of  the  I/O  handler  are  the  input  sentence  reader,  the  ATN  program 
loader,  the  lexicon  data  loader,  and  the  message  printer.  The  message  printer,  called  TALK, 
prints  out  trace  information  in  varying  levels  of  details  selected  by  the  ATN  user.  The  input 
sentence  reader,  PARSERREAD,  reads  input  sentences  typed  in  normal  English  orthographic 
convention.  The  ATN  program  loader,  ATNIN,  and  the  lexicon  data  loader,  LEXIN,  load  the 
AT.N  program  and  the  user's  lexicon  into  appropriate  storage  areas;  they  are  implemented  using 
the  host  Lisp’s  property  lists.’'*  The  ATN  operation  modules  component  consists  of  Lisp  func¬ 
tions  that  implement  each  of  the  ATN  operations,  including  all  the  ATN  arcs  (such  as  CAT, 
WRD,  PUSH,  etc.),  ATN  tests  and  actions  (such  as  NULLR,  SOTR,  HOLD,  TO,  JU.MP,  etc.),  and 
all  ATN  forms  (such  as  *,  LEX,  GETF,  etc.).  Each  of  these  Lisp  functions  tests,  modifies,  or 
retrieves  appropriate  data  elements  of  the  configuration  frame  of  the  current  process  or  one  of 
the  ancestral  or  descendant  processes  to  satisfy  the  semantics  of  the  associated  ATN  operation. 
The  control  driver,  consisting  of  the  Lisp  functions  PARSE,  PARSE2,  and  STEPPARSER,  is  the 
component  that  controls  the  parsing  computation  in  a  production-system-like  fashion. 

'■*  ATNIN  and  LEXIN  store  the  ATN  programmer’s  or  the  user's  definition  of  e.ich  ATN  sute  .ind  lexic.il 
entry  ai  the  property  value  of  Lisp  atom  of  the  entry  name  under  the  property  indicators,  '^arcs  and  '!=dict. 
respectively.  These  storage  areas,  which  are  (normally)  not  modified  during  p.irse  time,  function  as  the  read 
onlv  memory  of  the  ATN  machine. 


2B-113 


PARSE  initializes  the  ATN  machine,  reads  an  input  sentence,  and  then  invokes  PARSE2’^ 
to  parse  the  input  form.  PARSE2  interacts  with  STEPPARSER  to  drive  the  parsing  processes. 
Each  process  is  comparable  to  an  ATN  state  with  all  its  ATN  arcs  not  yet  attempted  saved  in 
the  list  of  alternatives,  and  those  ATN  arcs  already  attempted  having  been  removed.  As  a  dau 
type,  each  process  is  a  configuration  frame  made  of  a  record  of  10  fields,  which  contains  in  it 
all  the  information  necessary  for  the  execution  or  trace  of  the  process,  such  as  the  ATN  state 
name,  definitions  of  the  arcs  yet  to  be  tried,  the  table  of  ATN  registers  and  their  values,  the 
input  form  in  the  current  buffer,  the  path  information  of  the  current  search  process,  and  so  on. 
Initially,  PARSE2  designates  a  process  made  for  the  suiting  ATN  sute  as  the  current  process 
and  calls  STEPPARSER.  Upon  being  called  by  PARSE2  for  the  current  process,  STEPPARSER 
iteratively  tries  the  remaining  arcs  until  either  it  finds  the  first  one  that  leads  to  a  successful 
ATN  sute  transition  or  no  alternative  arcs  are  left.’^  Each  trial  of  such  an  arc  alternative  is 
realized  as  an  invocation  of  an  appropriate  arc  module’’  (such  as  CAT,  PUSH,  etc.)  with  the 
source  ATN  arc  definition  as  its  argument  and  the  dau  in  the  configuration  frame  as  its 
environment.  An  arc  module  returns  either  a  failure  report  or  the  successor  process’®.  The 
dau  of  the  successor  process  for  alternative  arcs  is  newly  made  with  the  information  copied 
from  the  original  definition  of  the  urget  ATN  sute  stored  in  the  read-only  ATN  program 
storage  area.  Eventually.  STEPPARSER  returns  to  PARSE2  a  queue  of  processes  (possibly 
empty)  that  is  the  result  of  appending  the  processes  for  the  remaining  alternative  arcs  to  the 
successor  processes  returned  by  the  first  successful  arc  module  call.  If  a  non-empty  successor 
process  queue  is  returned  from  STEPPARSER,  PARSE2  designates  the  first  process  in  the  queue 
as  the  current  process  and  pushes  the  remaining  processes  onto  the  suck  of  alternative 
processes.  An  empty  successor  process  queue  returned  to  PARSE2  from  STEPPARSER  indicates 

PARSE2  Ciui  aho  be  directly  invoked  from  SNePS  with  a  default  initioluation  via  a  SNePS  User 
Language  (SNePSUL)  command  for  ATN  parser  invocation  from  within  SNePS.  In  this  ease,  SNePS  communi- 
cates  with  the  ATN  parser  by  means  of  the  SNePS  structures  left  as  side-effects  of  parsing. 

The  order  of  making  the  choice  among  the  .iltern.atives  does  not  have  to  be  tied  to  the  order  in  which 
the  arcs  are  listed  in  the  ATN  program,  although  this  interpreter  adheres  to  that  order  for  want  of  another 
dependable  policy. 

”  Note  that  the  interpreter's  arc  modules  are  generic  (cf.  Section  7J.4). 

'•  Since  some  arcs  such  as  CAT  may  be  succeeded  by  more  than  one  process,  it  is  actually  a  set  of  successor 
processes. 
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that  the  previously  designated  current  process  has  been  completely  blocked  and  that  the  sys¬ 
tem  should  backtrack  one  step.  In  this  ca.se,  PARSE2  undoes,  if  necessary,  all  the  side-effects 
created  by  the  blocked  process,  pops  a  process  from  the  stack  of  alternative  processes,  and  desig¬ 
nates  it  as  the  current  process.  PARSE2  iterates  this  cycle,  invoking  STEPPARSER  until  either 
the  accumulated  results  of  the  parse  warrant  the  termination  of  the  parsing  or,  upon  receiving 
STEPPARSER’s  failure  repxirt.  it  finds  that  the  stack  of  alternative  processes  is  depleted. 

6.3  Handling  of  the  ATN’s  own  recursion  and  side-effects 

The  above  description  shows  that  the  Kwasny-Shapiro  ATN  interpreter  was  designed 
according  to  the  production-system  model.  It  maintains  a  queue  of  processes  and,  for  each  pro¬ 
duction  system  cycle,  selects  and  activates  a  process  out  of  the  queue  production-system  model, 
not  relying  on  the  host  system’s  recursive  stack.  In  this  section,  we  examine  how  delayed 
ATN  actions  or  side-effects  are  handled. 

PUSH-family  arcs  (namely,  PUSH,  CALL,  and  RCALL),  which  push  the  ATN’s  environ¬ 
ment  one  level  deejxr  in  the  ATN’s  own  recursion  stack,  are  hybrid:  they  contain  in  their  ATN 
definition  two  compxsnents,  which  are  executed  on  two  non-adjacerit  occasions.  One  part  is  exe¬ 
cuted  when  the  process  for  the  arc  itself  is  in  control;  the  other  paart,  which  we  call  the  list  of 
push-pop  actions,  remains  dormant  until  the  process  for  a  matching  pxap  acquires  control  and 
awakens  it  to  be  executed.  When  the  process  for  a  member  of  the  PUSH-family  is  executed,  it 
records  the  push-pwp  actions,  that  is,  the  list  of  <action>s  and  <terminal  action >,  into  a  spaecial 
suck  reserved  in  the  configuration  frame  for  this  particular  purpxae,  and  lets  it  be  inherited  by 
all  its  successor  processes.  One  of  the  main  tasks  of  POP  arc  modules  is  to  f»p  the  top  of  this 
suck  and  execute  these  push-pop  actions. 

In  designing  an  ATN  machine,  the  designer  may  sometimes  find  that  some  dau  operations 
cannot  be  confined  in  one  process’s  configuration  frame  at  an  accepuble  cost.  We  say  that  a 
dau  manipulation  is  done  as  a  side-effect  if  the  effect  of  the  data  operation  is  not  confined 
within  a  process’s  configuration  frame.  On  a  system  that  adopts  the  flexible  search  strategy, 
determining  which  side-effects  are  to  be  undone  and  which  are  to  be  kept  for  each  process  is  an 
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extremely  complicated  matter.  We  have  noted  that  the  Kwasny-Shapiro  ATN  interpreter 
mostly  runs  in  the  SNePS  environment  in  which  it  is  allowed  to  modify  SNePS  network 
structures  as  a  side-effect  in  the  middle  of  the  parsing  operation.  In  general,  SNePS  network 
structures  are  global.  In  this  ATN  machine,  one  of  the  ten  cells  of  the  configuration  frame, 
dedicated  to  remembering  all  side-effects,  records  all  the  SNePS  nodes  created  by  the  process. 
When  backtracking  takes  place,  PARSE2,  by  accessing  this  cell,  can  thus  deletes  all  the  SNePS 
structures  created  by  the  failed  process. 

7.  implementation 

In  this  section,  we  present  a  compiler  implemented  to  test  the  state-module  re-entrant 
technique  we  have  proposed,  showing  some  details  of  the  compiled  ATN  machine  generated  by 
the  compiler.  This  compiler  was  designed  to  produce  a  compiled  ATN  machine  that  would  be 
functionally  equivalent  to  the  Kwasny-Shapiro  interpreter  ATN  machine,  following  the  same 
ATN  formalism  and  running  in  precisely  the  same  software  and  hardware  environments.  For 
this  reason,  quite  a  number  of  program  modules  are,  in  fact,  shared  by  both  ATN  machines. 
We  will  first  describe  the  ATN  compiler  in  order  to  see  how  it  generates  a  compiled  ATN 
machine  for  a  given  ATN  program,  and  then  we  will  examine  the  details  of  the  compiled  ATN 
machine  as  an  abstract  object. 

7.1  Description  of  the  compiler 

TTie  ATN  compiler  is  a  program  package  written  in  Franz  LISP,  running  in  the  same 
software  and  hardware  environments  as  the  Kwasny-Shapiro  ATN  interpreter.  Given  a  file 
containing  an  ATN  program,  it  generates  a  Lisp  program  which  is  a  compiled  ATN  machine. 
The  initial  output  of  the  ATN  compilation  is  a  file  of  Lisp  source  code  which  may  be  further 
compiled  into  a  lower-level  language  code  using  the  general  Lisp  compiler  provided  in  the 
operating  system  environmenL  As  a  data  type,  an  ATN  program  is  a  set  of  ATN  state 
definitions;  To  compile  an  ATN  program,  the  cycle  of  reading  a  state  definition  and  then  cal¬ 
ling  COMPSTATE,  the  specialist  for  the  state  compilation,  is  iterated  until  the  source  ATN 
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program  file  has  been  processed.  For  each  ATN  sure,  COMPSTATE  generates  a  segment  of  Lisp 
code  that  defines  an  associated  state  module  conuining  in  it  as  many  entry  points  as  the 
number  of  arcs  emanating  from  the  state.  Then,  for  each  arc  in  the  sutc,  COMPSTATE  calls 
COMPARC,  which  in  turn  calls  an  appropriate  arc  compilation  specialist.  Figure  3  shows  the 
Lisp  code  for  COMPARC 

Figure  3.  Lisp  definition  of  CO.MPARC  function 
structures  for  the  compiled  ATN  machine. 

*«**»***»****»«  ABOUT  HERE  »»»**»»»«»*»«»» 

The  arc  compilation  specialists  are  named  COMP<arcname>  (such  as  COMPCAT,  COMPPUSH, 
COMPJUMP,  etc.),  where  <arcname>  is  the  name  of  the  arc  it  is  specialized  for.  Each  one  pro¬ 
duces  the  code  of  each  state-dependent  arc  module.  The  function  COMPGROUP,  the  specialist 
for  the  GROUP  arc  compilation,  generates  the  code  for  a  pseudo-state  (cf.  Section  7.3.4)  and 
recursively  calls  appropriate  arc  specialists  to  generate  the  codes  for  the  embedded  arc  modules, 
allowing  arbitrarily  deeply  embedded  GROUP  arcs. 

The  task  of  an  arc  compilation  specialist  is  to  re-write  the  associated  interpreter’s  generic 
arc  function  (such  as:  CAT,  PUSH,  etc.)  into  a  state-specific  arc  module  completing  as  much 
compile  time  evaluation  and  code  expansion  as  possible  in  the  context  determined  by  the  source 
slate  and  the  arc  instance.  A  typical  example  of  code  expansion  is  unfolding  of  loops.  Itera¬ 
tions  in  the  generic  arc  function  which  are  designed  to  interpret  arbitrarily  many  consecutive 
ATN  source  actions  are  opened  and  unfolded  into  a  finite  sequence  of  the  actual  actions  found 
in  the  given  arc  instance.  Whenever  the  specialist  can  obtain  a  compile  time  evaluation  of  a 
form,  it  takes  one  of  the  two  actions:  (i)  if  the  form  is  the  condition  part  of  a  condition -action 
construction,  then,  depending  on  the  obtained  value,  it  either  directly  proceeds  to  generate  the 
code  for  the  action  part,  or  totally  ignores  the  condition-action  construction;  (ii)  if  the  form  is 
not  a  condition  part,  then  it  generates  the  code  replacing  the  form  with  the  bound  value  with 
a  quote  in'frdnt  so  that  it  won’t  be  evaluated  again  by  the  compiled  machine  later.  For  those 
forms  that  cannot  be  evaluated  at  compile  lime,  the  specialist  generates  the  code  largely  by 
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duplicating  the  corresponding  segment  of  the  code  of  the  interpreter’s  generic  arc  module. 
Sometimes,  the  arc  compilation  specialist  may  attempt  a  rough  evaluation  to  simplify  the  com¬ 
piled  code  if  it  should  succeed.  By  rough  evaluation,  we  mean  an  evaluation  that  may  succeed 
at  compile  time  in  some  special  case.  An  example  is  the  examination  of  the  test  part  of  ATN 
arcs,  which  often  turns  out  to  be  a  Boolean  constant  T.  The  function  ROUGHTEST-T  is  used 
to  avoid  generating  code  for  a  parse-time  call  to  the  function  TEST  if  the  test  is  simply  the 
constant  T  in  the  ATN  source  which  even  the  compiler  can  tell  the  value  of.  However,  we 
have  not  yet  included  any  extensive  code  optimization  routines  or  error  checking  measures. 

7.2  The  configuration  frame  of  the  ATN  machine 

The  data  structures  of  the  compiled  ATN  machine  are  based  on  the  interpreter  machine’s 
data  structures.  Both  machines  maintain  a  queue  of  processes.  As  a  data  type,  each  process  is  a 
configuration  frame,  which  is  a  record  of  10  information  fields.  In  the  interpreter,  the 
configuration  frame  holds  the  following  information: 

<state>,  the  name  of  the  current  ATN  state: 

<arcset>,  the  definitions  of  the  remaining  alternative  arcs; 

<bufFer>,  the  input  form  not  yet  consumed; 

<regset>,  the  current  ATN  registers  and  their  values; 

<hstack>,  the  stack  of  hold  registers  and  their  values; 

< level  >,  the  level  of  the  ATN  recursion: 

<pstack>,  the  stack  of  the  push-pop  action  lists; 

<path>,  the  information  on  the  path  history; 

< nodeset  >,  the  SNePS  nodes  created  as  the  process’s  side-effect; 

<wcight>,  information  for  the  search  priority. 

The  structure  of  a  configuration  frame  used  in  the  compiled  machine  is  the  same  as  the 
interpreter’s,  except  that  <arcset>  is  here  replaced  by  <entry>  for  the  value  of  the  entry 
point.  Another  minor  difference  is  that,  in  the  compiled  machine,  push-pop  actions  are  no 
longer  a  list  of  source  ATN  actions.  When  a  PUSH-family  process  is  executed,  the  sequence  of 
the  actions  in  the  list  is  defined  into  an  equivalent  lambda  function  with  a  unique  name 
assigned,  and  the  name,  instead  of  the  action  list,  is  stored  in  <pstack>. 

7.3  Architecture  of  the  compiled  ATN  machine 
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The  compiled  ATN  machine  is  made  of  two  components;  the  core  component,  and  the 
AT N  module  component.  The  ATN  module  component,  which  originates  in  the  user’s  ATN 
program,  consists  of  all  the  compiled  ATN  modules,  each  of  which  is  either  a  mapping  of  an 
ATTi  state  or  an  arc  instance.  The  core  component  includes  the  control  driver  and  its  auxiliary 
modules.  Since  this  component  is  ATN  program  invariant,  it  is  stored  in  a  separate  file  and 
loaded  into  the  compiled  ATN  machine  when  the  ATN  module  component  starts  loading.  In 
this  section,  we  will  examine  the  skeletons  of  the  core  component  and  the  prototypes  of  ATN 
state  and  arc  modules  after  we  briefly  discuss  an  issue  about  the  control  of  search  order. 

7.3.1  Handling  of  side-effects  and  semi-flexible  depth-first  search 

We  have  pointed  out  that,  in  the  Kwasny-Shapiro  interpreter,  some  SNePS  structures 
may  be  created  while  a  process  is  in  execution,  and  that  such  a  creation  is  a  side-effect  In 
order  to  handle  the  side-effects  inexpensively,  the  interpreter  searches  the  ATN  space  in  a 
depth-first  fashion.  We  have  imposed  a  non-trivial  constraint  in  the  compiled  machine  that 
restricts  the  degree  of  flexibility  in  the  search  order  to  a  certain  extent  In  our  implementation, 
we  assume  that  the  most  recently  created  process’s  successors  get  the  highest  search  priority 
and  that  the  search  order  is  flexible  among  the  child  processes  of  the  most  recently  expanded 
processes,  rather  than  among  all  the  processes  spread  over  the  entire  search  frontier  in  the  ATN 
net  This  restriction  appears  to  result  in  depth-first  search.  However,  this  is  different  from  the 
blind  depth-first  control  thaat  the  compiled  ATN  machines  of  the  host-recursion  model  per¬ 
form.  We  call  the  depth-first  control  of  our  kind,  in  which  flexibility  is  allowed  within  each 
slate,  remi -flexible  depth-first  Clearly,  semi-flexible  depth-first  control,  which  is  a  kind  of 
“hill-climbing  strategy”  (Winston  (1984:93)],  is  mid-way  between  blind  depth-first  search  and 
fully  flexible  search,  and  is  still  better  than  blind  depth-first  The  benefit  of  this  constraint  is 
that  the  side-effects  caused  by  an  arc  module  can  be  undone  by  its  parent  state  module  when 
the  parent  module  regains  control  either  to  invoke  another  arc  module  or  to  find  that  there  is 
no  more  left  However,  by  implementing  a  different  procedure  for  handling  side-effects,  one 
can  certainly  build  an  ATN  machine  of  this  model  to  be  fully  flexible.’’  Thus,  this 

M.irtms's  (198.1)  implementation  of  multiple  belief  spaces  in  SNePS.  in  whkh  the  interpret.ition  of 
SNePS  structures  are  context-dependent,  suggests  one  possible  solution  to  this  problem. 
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implemeniational  constraint  should  not  be  construed  as  a  theoretical  weakness  of  the  model 
itself. 

73.2  Control  driver 

The  top-level  control  driver  of  the  production-system  ATN  machine  is  the  function 
PARSE  as  shown  in  Figure  4. 

Figure  4.  The  top-level  control  driver 
*»«****«*?r***  ABOUT  HERE  **************” 

PARSE  takes  a  sequence  of  pre-processed  input^  and  an  atom  for  the  name  of  the  initial  state 
as  its  arguments.  It  then  initializes  the  process  queue  QCFGS  with  the  given  input  and  the  ini¬ 
tial  state,  and  begins  to  crank  the  infinite  loop  of  executing  the  frontmost  process  taken  from 
the  queue  and  updating  the  queue.  As  Figure  5  illustrates,  EVAL-PC^’  invokes  the  appropri¬ 
ate  ATN  sute  module  with  the  process’s  configuration  frame  CFG  as  its  argument. 

Figure  5.  EVAL-PC,  a  module  for 
a  function  application 

»*»*»***«******  ABOUT  HERE  *»»»«»»«»*«»* 

7.3.3  State  modules 

In  this  section,  we  give  an  example  of  compilation  of  an  ATN  state.^^  A  fragment  of  an 
actual  ATN  program  as  given  in  Figure  6  yields  a  compiled  ATN  module  as  shown  in  Figure  7. 
When  entered,  a  state  module  first  removes  any  side-effects  left  by  the  last-executed  child  arc 
alternative  before  it  invokes  the  appropriate  arc  module  pointed  to  by  the  entry  point.  Because 
of  our  policy  of  having  the  side-effects  of  the  previous  process  removed  by  the  process  called 
for  the  next  sibling  process,  we  have  implemented  such  that  a  state  module  is  re-entered  once 

The  input  pre-precoRsing  includes  actions  such  as  dictionary  lookup,  decapitalization,  and  punctu.-ition 
handling. 

EVAL-PC  as  well  is  PARSE,  is  one  of  the  best  checkpoints  to  be  monitored  by  ATN  debuggers. 

-■  An  ATN  source  state  <state>  is  mapped  to  a  compiled  ATN  sute  module  with  the  name  CA%<state>. 


2B-120 


Figure  6.  A  fragment  of  an  ATN  program 
•*•***»*»*»****  ABOUT  HERB  ************** 


Figure  7.  Compiled  ATN  state  module 
corresponding  to  state  SP 

««««,,««,  about  here  **************** 

more  after  all  of  its  arc  alternatives  have  been  consumed,  just  for  the  purpose  of  removing  the 
side-effects  left  by  the  last  arc  alternative.^^  If  an  entry  value  leads  to  a-  mis-firing  of  an  arc 
module  due  to,  a  rejection  of  the  arc  module’s  entrance  test,  the  control  may  completely  exit 
the  state  module  and  then  re-enter  the  state  module  for  the  next  arc  alternative.  However,  in 
such  a  case  in  the  present  implementation,  instead  of  exiting  vacuously,  the  state  module  keeps 
invoking  different  arc  modules,  altering  the  entry  values^"*  until  it  finds  the  first  successful 
one  in  order  to  cut  do\xn  the  frequency  of  re-entrance  to  a  state  module.  An  arc  module 
returns  a  record  with  two  information  fields:  a  set  of  successor  archi processes  and  a  set  of 
SNePS  nodes  created  by  the  arc  module  as  its  side  effect  However,  if  the  entrance  to  the  arc  is 
rejected  by  the  ATN's  entrance  test,  a  mere  NIL  report  instead  of  a  record  of  two  empty  sets, 
is  returned  to  the  state  module.  This  informative  signaling  method  makes  it  possible  for  a 
state  module  to  differentiate  the  case  of  an  immediate  test  entrance  failure  from  that  of  a 
blocked  ATN  search  in  the  search  space  but  leaving  no  side-effects  and  successor  processes.  This 
distinction  is  utilized  by  pseudo-sute  modules  due  to  GROUP  arcs  in  determining  when  the 
rest  arc  alternatives  should  be  aborted.  A  state  module  returns  to  the  top-level  driver  a  set  of 
processes  (including  possibly  an  empty  set)  which  is  the  union  of  the  newly  expanded  succes¬ 
sor  processes  and  the  process  itself  to  be  re-entered. 

7.3.4  Arc  modules 


And.  this  Ls  also  a  good  time  to  give  the  user  the  message,  if  desired,  that  the  coMrol  exits  the  sute. 

-■*  Presently,  the  function  NEXT  EN£N  that  finds  the  next  entry  value,  is  a  nacro  replaced  by  Lisp's 
AODl  functioiT.  But.  this  macro  is  subject  to  be  refined  into  any  appropriate  operatios  to  meet  the  particul.ir 
need  of  the  individual  installation  site  depending  on  their  policy  over  search  order. 
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Arc  modules^^  in  the  compiled  machine  are  closely  related  to  the  interpreter’s  arc  func¬ 
tions.  However,  while  the  interpreter’s  arc  functions  are  generic,  the  compiled  machine’s  arc 
modules  are  specific  to  a  state.  A  generic  arc  function  takes,  as  its  arguments,  a  specific  ATN 
source  of  the  arc  body  to  be  interpreted  and  a  specific  configuration  frame  of  the  current  state, 
and  interprets  the  arc  body  within  the  eval-frame  as  indicated  by  the  configuration  frame  to 
realize  all  the  tests  and  actions  to  be  taken  by  the  arc.  But  arc  modules  in  the  compiled 
machine  are  each  bound  to  a  state  with  some  variable  bindings  and  form  expansions  done  dur¬ 
ing  compile  time.  Therefore,  a  compiled  arc  module  realizes  the  class  of  specific  instances  of  an 
arc  in  a  particular  state,  and  is  invoked  in  order  to  complete  the  last  phase  bindings  and  form 
evaluations  that  had  to  be  delayed  until  parse  time.  Arc  modules  that  realize  an  insunce  of 
CAT,  TST,  WRD,  VMR,  TO,  or  JUMP  are  straightforward  in  the  sense  that  they  are  largely  a 
mere  transformation  of  the  respective  generic  arc  function  filtered  through  the  first-phase 
evaluation  by  the  compiler.  But  the  arc  modules  for  the  GROUP  arc  and  push-family  arcs 
deserve  some  comments  on  their  somewhat  tricky  implemenutional  tactics. 

We  pointed  out  that,  in  the  interpreter  machine  (cf.  Section  6.3),  a  list  of  the  push-pop 
actions  are  stored  in  successor  process’s  configuration  frame  in  their  ATN  source  notation  so 
that  it  can  be  later  interpreted.  In  the  compiled  machine  (cf.  Section  7.2),  the  push-pop  actions 
are  defined  into  a  unique  lambda  function  on  the  parser  control’s  entrance  to  the  associated 
PUSH  process,  and  its  address  (i.e.,  the  function  name)  is  stored  in  the  configuration  frame  so 
that  it  can  be  later  invoked  by  the  right  POP  process. 

A  GROUP  arc  makes  itself  as  if  it  were  a  state.  A  set  of  arcs  emanate  from  a  GROUP  arc 
very  much  like  from  a  state.  But  the  first  positive  result  of  the  entrance  test  to  any  of  the 
alternative  arcs  within  a  GROUP  aborts  all  the  other  alternatives.  We  will  call  the  imaginary 

TTw  .trc  modules  dependent  on  G^‘S><stace>  are  named  CAftcstate><k><arcname  k>  where 
<arcname-k>  is  the  k  th  arc  emanating  from  the  state  <sute>  in  the  source  program.  This  naming  scheme 
systematically  assigns  every  compiled  ATN  module  a  mnemonic  name.  However,  there  is  a  slim  chance  of 
having  a  collision  between  a  state  module  and  an  arc  module.  For  instance,  if  there  is  a  state  named  NP  with 
Its,  say,  first  arc  being  CAT,  and  if  there  is  another  state  named  NPICAT,  then  the  name  CAVcNPICAT  will 
be  assigned  both  to  the  module  for  the  state  NPlCAT  and  to  the  module  for  the  first  arc  CAT  of  the  state  NP. 

Collision  of  even  this  kind  can  be  avoided  by  maintaining  a  ubie  of  module  names,  and  letting  it  be  checked 
before  every  new  module  is  to  be  named,  which,  we  believe  however,  will  be  a  wMe  of  resources  with  no 
practical  danger  existing. 
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Slate  created  by  a  GROUP  arc  a  pseudo  stale.  A  GROUP  arc  module^*"  is  by  itself  a  pseudo- 
state  module.  It  invokes  its  own  arc  modules  with  a  control  slightly  different  from  that  of 
normal  state  modules.  Apparently,  one  of  the  strong  motivations  of  having  GROUP  arc  con¬ 
vention  in  ATN  notation  is  to  allow  ATN  programmers  to  express  their  special  desire  for  a 
sequential  control  over  otherwise  non-deterministic  alternatives.  Thus,  we  also  design  GROUP 
arc  modules  such  a  way  that  the  control  flows  sequentially  just  as  indicated  by  the  ATN  pro¬ 
grammer.  For  this  reason,  a  pseudo-state  made  by  a  GROUP  arc  is  an  OR -construction  to 
which  the  concept  of  re-entrant  state  module  is  not  applied.  Figure  8  exemplifies  a  fragment 
of  an  ATN  program  with  a  GROUP  arc  used  in  it,  and  Figure  9  illustrates  how  the  GROUP  arc 
is  mapped  to  a  pseudo-state  module  in  a  compiled  ATN  system. 

Figure  8.  A  fragment  of  an  .ATN  program  with  a  GROUP  arc 
«»»»»»*»»»«»»  HFRF  *»**»»*«»*»**•** 


Figure  9.  Compiled  ATN  pseudo-state  module 

corresponding  to  pseudo-state  GROUP  arc  of  state  NP 

t*»»n***t=*»***  about  here  *»»»**»»»***»*«» 

7.4  Parse  time  control 

A  clear  advantage  of  the  production-system  model  is  that  a  vast  amount  of  system  con¬ 
trol  can  be  exercised  at  parse  time.  With  an  appropriate  refinement  of  the  algorithm  for  the 
entry  point  value  computation  at  each  entrance  to  a  state  module,  the  entirely  different  ATN 
search  can  be  explored.  Even  without  a  sophisticated  control  strategy  algorithmically  set  up, 
the  user  can  experiment  with  different  search  controls  manually  using  the  convenient  and 
powerful  monitoring  devices  that  can  be  easily  inserted  to  the  system  because  of  the 
production-system  architecture.  The  places  that  such  monitoring  devices  can  best  fit  are  the 
two  control  driver  modules,  PARSE  and  EVAL-PC.  PARSE  is  the  module  that  drives  the 

The  pwudo  state  of  a  GROUP  arc  is  represented  by  a  *•"  concatenated  at  the  end  of  the  name  of  the 
GROUP  arc  module  such  as  f:A‘?o<state><k>GROUP*.  and  the  embedded  arcs  from  the  pseudosute  are 
further  named  conststentlv  m  accordance  with  the  above  rule  for  arc  module  naming. 
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production-system,  and  EVAL-PC  is  the  module  that  fires  the  current  processes,  namely,  each 
selected  state  module  for  every  production-system  cycle.  For  instance,  by  setting  the  global 
flag  PARSE-MONITOR  on,  the  user  enters  a  break  package  every  time  the  search  driver  iterates 
the  production-system  cycle  as  was  illustrated  in  Figure  4.  In  this  break  package,  the  user  can 
easily  examine  or  manipulate  the  content  of  the  process  queue  (QCFGS)  in  order  to  trace  or 
modify  the  search  order.^^  Similarly,  by  setting  the  global  flag  EVAL-PC-MONITOR  on,  the 
user  enters  a  break  package  every  time  the  search  driver  is  about  to  execute  the  process  selected 
from  the  prpe^  queue  as  was  illustrated  in  Figure  5.  The  user  can  also  enter  the  break  loop 
when  the  control  is  about  to  invoke  certain  selected  state  modules  only,  by  maintaining  a  glo¬ 
bal  list  BREAK-POINTS  of  state  module  names  which  is  checked  with  for  a  break  in  EVAL- 
PC  as  shown  in  Figure  5.  In  the  break  package  entered  from  EVAP-PC,  the  user  can  examine 
or  manipulate  the  contend  of  the  current  process  (CFG).  TTie  user  can  freely  and  safely  re¬ 
arrange  the  order  in  which  the  state-dependent  alternative  arc  modules  are  executed  b-.  merely 
altering  the  entry  point  value  stored  in  the  process’s  configuration  frame.  Figure  10  shows  an 
example  of  a  user  exercising  this  control  facility. 

Figure  10.  A  parse  time  trace  of  the  control  driver 
monitored  by  the  user 

ti**»*»n*******  about  hfrf  *»**»**»***»*»*» 


7.5  Results 

Figure  11  ill'istrates  an  efficiency  comparison  between  the  two  ATN  machines,  the  inter¬ 
preter  and  the  compiled  machines.^* 

Figure  11.  Efficiency  comparison  between  interpreted  and 
compiled  ATN  machines  with  Shapiro  (1985)  ATN  program 


’’  However,  shuffling  of  the  processes  in  this  queue  should  be  done  with  a  great  amount  of  caution  excr 
cised  if  the  site-dependent  policy  of  side-effects  remov.il  imposes  a  certain  limitation. 

We  m.idt  this  comparison  with  all  the  other  factors  remaining  precisely  the  same.  The  two  Lisp  pro 
grams  were  both  Lesp  compiled  for  their  maximum  speed,  and  the  test  run  was  made  when  there  was  no  oth 
er  users  logged  in  on  the  system. 
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The  ATN  program  used  was  the  grammar  for  natural  language  processing  by  Shapiro  (1985). 
Some  inputs,  prefixed  by  an  “S",  were  English  sentences  which  caused  the  system  to  respoad  in 
English  through  the  whole  chain  of  processings  for  surface  sentence  understanding,  inference, 
and  surface  sentence  generation,  while  some  others,  prefixed  with  “G”,  were  SNePS  strt  -turcs 
which  caused  the  system  to  generate  surface  English  sentences  expressions  for  them.  The  input 
forms  were  given  to  the  system  in  the  order  as  they  are  presented  in  the  figure,  w'hich  caused 
the  system  to  construct  and  maintain  a  particular  cognitive  context  With  these  non-trivial 
data,  the  compiled  machine  was  found  to  be  running  about  3  times  as  fast  as  the  interpreter 
counterpart.  Considering  the  factor  that  this  comparison  was  made  in  SNePS  context  where 
many  SNePS  activities  ether  than  ATN  parsing  per  se,  such  as  deductions,  were  also  taking 
place  embedded  in  parsing  routines,  we  reasonably  conclude  that  the  actual  efficiency  improve¬ 
ment  should  be  far  more  than  this  figure.  Through  further  innumerable  successive  tests  car¬ 
ried  out  in  SNePS  Group  Ijb.  we  were  convinced  that  the  presented  design  of  our  compiled 
machine  contained  no  apparent  faults. 

8.  CONCLUSION 

Compiled  ATN  machines  of  the  production-system  model  constructed  along  the  incremen¬ 
tal  compilation  approach  are  important.  We  have  demonstrated  that  the  design  technique  of 
re-entrant  state  module,  in  which  an  ATN  state  module  is  built  so  as  to  contain  as  many  entry 
points  as  the  number  of  the  arcs  in  it,  i.''  a  valid  solution  to  the  problem  of  building  a  compiled 
production-system  ATN  machine  that  allows  flexibie  search  control.  This  design  provides 
enough  motivation  to  go  funher  into  arc-module  modularization  beyond  the  traditional  state- 
module  modularization,  thus  extending  the  scope  of  incremental  compilation  of  ATN  programs. 
Our  satisfactory  implementation  has  proven  that  this  design  is  practically  viable,  too,  by 
enhancing  the  efficiency  of  ATN  program  execution  in  terms  of  the  speed  as  much  as  three 
times  cornered  with  the  interpreter  machine  in  the  same  environment.  The  practical  benefit 
of  flexible  search  control  an  ATN  machine  of  this  design  has  hinges  upon  the  effectiveness  of 
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the  algorithm  used  to  calculate  the  value  for  the  entry-point  on  each  entrance  to  '  state 
module.  The  study  of  this  issue  remains  open. 
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Figure  1.  Input/outpui  relations  of 
various  ATN-related  machines 
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a  re-entrant  state  module 
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(DEF  OCMPAKC 

(lJMay>i  (OPORT  SSID  AQlUNn  .ARC) 

;;  OPORT  is  the  compiler  output  file 

SSID  is  the  name  of  the  state  which  the  arc  is  dependent  on 
i^COLNT  counts  the  static  order  of  the  arc  in  the  state 
ARC  is  the  source  ATN  definition  of  the  arc 
(LET  ((AMIIULE-Ni“ME  (f^WCE-AVflXJLE-NMC  SSID  AOCHKT  ARC) ) ) 

(GQND  (dS-VALID-ARC  ARC) 

(SETQ  CA-ARCS  (CONS  AMDULE-NAVC  CA-ARCS)) 

;  a  global  variable,  CA-ARCS  remembers  all  the  arc  modules  generated 
(FLNCALL  (GET-OCMPARC- SPECIALIST  ARC) 

OPORT 

AMCOJLE-NAIVE 

ARC)) 

(T  (PRINT-CODE  CPORT  ••;;**ERROR  IN  ARC  FLNCTION**") 
(MSG  N  "UNKNCWN  ARC  "  (ARC-NAVE  ARC)  "  IN  "  SSID) 
NIL))))) 

Figure  3.  Lisp  definition  of  CCXIPARC  function 


PROCEDURE  CTMPARC  (OUn'lLl-;  STATE-NANE;  ARC-0(XNT;  ARC-WDY); 

{ 

generate  the  name  of  the  compiled  arc  module; 

IF  no  error  in  ARC-WDY 
THEN 

call  arc-specialist  to  generate  code  for  the  compile  arc  modu 
ELSE 

give  message  for  an  error  in  ARC-BCDY  of  ATATE-NAVE; 
write  in  OLTFILE  an  error- indi cat ing  code; 


Figure  3.  Pseudo-code  description  of  CCMPARC 
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(DEF  PARSE 

(L^MBD\  (PREP -SENTENCE  INIT-STATE) 

;;  PREP-SENTENCE  is  the  pre-processed  input  sentence 
INIT-STATE  is  the  name  of  the  ATN  state  to  start  from 
(DO  (((5CPGS  (INITIALIZE- POO  PREP-SENTENCE  INIT-STATE)) 

(PARSE- SET  NIL)) 

;  PARSE-SET  is  non- local ly  modified  by  successful  parsings 
(dSNLLL.POO  QCPGS)  PARSE-SET) 

;  if  the  queue  is  empty,  returns  PARSE-SET 
(BREAK  PARSE- BREAK  (AND  PARSE-NCNITDR  (EXAM-CJUEUE))) 

;  if  PARSE -MDN lie®  flag  is  on. 

displays  the  process  queue,  and  enters  the  break  package 
(SEID  CPFGS 

(MI3WE-PO0  (EVAL-PC  ( FIRST- PC! POQ  (^TOS) ) 

(REST-PCQ  CCH3S))) 

;  the  queue  is  updated  with  the  result  of  the  current  process 
(IF  (AND  ONE-PARSEFLG  PARSE- SET) ( SEH)  CPPGS  NIL-PCQ))))) 

Figure  4.  Top-level  control  driver 


(E£F  EVAL-PC 
(LAMBDA  (CFG) 

;;  CFG  is  the  configuration  frame  of  the  current  process 
(BREAK  EVAL-PC- BREAK  (AND  EVAL-PC-MDNITOR  (EXAM- PC))) 

;  if  EVAL-PC-NDNITOR  flag  is  on. 

;  displays  the  config.  frame,  and  enters  the  break  package 
(•BREAK  (IS-BREAKPOINT-SET  (GET- STATE -MUJLE- NAIVE  CFG)  BREAK  POINTS) 
(CONS  ’break-at -bpoint  (GET- STATE -NOXILE- NAME  CFG) ) ) 

;  enters  the  break  package 

;  if  the  state  module  name  is  listed  in  BREAK-POINTS 
(FONCALL  (GET-STATE-NOXLE-NAIVE  CFG)  CFG))) 

Figure  5.  EVAL-PC,  a  module  for 
a  function  application 
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(SP  :  parse  a  sentence 
(\VRD  (WCVHAT)  T 

;  if  it  starts  with  '\vho"  or  "what",  it’s  a  question. 
(SETR  TYPE  ’0)  (LIFTR  TYPE)  (SETR  SEBJ  95X)  (TO  V)) 

(PUSH  NPP  T 

;  a  statement  starts  with  a  noun  phrase  —  its  subject. 
(SENDR  TYPE  ’D)  (SENDR  P-ATT)  (SENDR  NH2) 

(SENDR  BELIEVER. RBG) 

(SETR  TYPE  (OOND  ((NULLR  STYPE)  ’D) 

(T  ’(D  S)))) 

(LIFTR  TYPE)  (SETR  SUBJ  •) 

-  .(TO  V))) 


Figure  6.  A  fragment  of  an  ATN  program 

(Note  that  there  are  tw)  arc  alternatives  in  state  SP) 


(DEF  CAyflSP 
(LAMBTYV  (CFG) 

(CU-AN-S1DIT£ITS  (GET-SIDEEFFBCTS  CPC3))  • 

:  First,  the  sideeffects  left  by  the  previous  entrance  are  undone 
(ID  (:  IX)- loop  keeps  the  control  to  stay  in  the  state  nv  Jale 
:  until  the  first  successful  arc  alternative  is  fjjnd. 
(ENTRY  (GET- EN! PC  CTO)) 

:  E^'fRY  is  the  value  of  the  entry-point 
(NDS+SONS)) 

:  the  value  returned  from  the  successor  arc  module 
(NIL  ’ INTINITE-LOOP) 

(SLTO  NDS+SONS 

(CASDO  ENTRY 

(1  (CmSPlVRD  CFG)) 

(2  (CmSPlPUSH  CFG)) 

(T  (RETURN  NIL-PCQ)))) 

(SEIQ  ENTRY  (NDCT-EN.EN  ENTRY)) 

(aiND  (NDS+SON 

(RETURN  (MERGE-POO-PCQ&PC 

(SUOCESS<»-PCQ  NDS+SCNS) 

(UPDATE-CURRENT- PC 
CFG 

(GET-S1I*EFFBCTS  MK+SONS) 

(NEXT-EN.EN  ENTRY))))) 

(T  ’DO-UOCPING))))) 

Figure  7.  Compiled  ATN  state  module  corresponding  to  state  SP 
(Note  that  the  compiled  state  module  CASfaSP  for  state  SP 
contains  in  it  two  non-trivial  entry-points) 
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(NP 

(GR(XIP  :  first  see  if  there’s  a  propositional  attitude 
(PUSH  G  (OR  (GETA  VHICH)  (GETA  VERB)) 

;  generate  an  S  to  express  the  M-objective 
:  of  a  propositional  attitude  if  there  is  one. 

;  if  the  CfiJ-register  contains  a  propositional  node. 

;  then  that  node  has  either  a  vdiich*arc  or  a  verb-arc 
(SENDR  DONE)  (SEM*  DIMHV  (RTPRINT  ’THAT))  (TO  IJO)) 

;else:  the  proper  number  is  pl  for  a  class,  sing  for  an  individual 
(JIM>  NPl  NUVBR) 

(JIMP  NPl  (OR  (GETA  SUB-)  (GETA  SUP-)  (GETA  OASS-)) 

■  (SETR  NUVBR  ’PL)) 

(JLM>  NPl  (NCTT  (OR  (GETA  SUB-)(GETA  SUP-)(GErA  OASS-))) 

(SETR  NLMBR  ’SING)))) 


Figure  8.  A  fra^nent  of  an  ATN  program  with  a  group  arc 
(Note  that  the  GROUP  arc  groups  the  four  sub-arcs  in  it 
into  a  sequence  of  nutually  exclusive  alternatives.) 


(DEI-  (AWPIGRCUP 
(lAVlBIA  (CFG) 

(CLEAN- SIDliEFFS  (NDSJPC  CR3)) 
(C*  (CAWPIGROUPMPUSH  CFG) 
(CA9'«NP1GR0UP*2J(MP  CPG) 
((ASasTlGROUP’SJCMP  CFG) 
(CA9!iNPlGROUP*4JLMP  CR3)))) 


Figure  9.  Compiled  ATN  pseudo-state  module 

corresponding  to  pseudo-state  group  arc  of  state  .NP 
(Note  that  the  four  sub-arc  modules  are  combined 
into  a  sequential  OR-construction. 
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(  PARSE-VTNITOR  IS  ON  ] 

:  Young  Lucy  petted  a  yellow  dog 
Process  Queue 

1 :  (1  nil  (young  &  Lucy  &  .  . . )  (]j^os  . . . ) 

Break  parse-breake 
<1>:  (return  t) 

Process  (^eue 

1  :  (1  nil  ( young  &  Luc  y  &  . .  .  )  CA^os  p  . .  . ) 

2;  (2  nil  (young  &  Lucy  &....)  CA9I>s  . . . ) 

Break  pa rse -br cake 

<1>;  (setq  parse-monitor  nil  eva 1 -pc -moni tor  t) 
t 

[  NOV.  PARSE-VCNITOR  IS  OFF.  AND  EVAL-PC-M>J1T0R  IS  CW  ] 

<1>:  (return  t) 

(ronf  igurat  ion  Frame  for  (CASljsp  .  l) 

(entry  =  1 ) 

(nodeset  -  NIL) 

(buffer  -  (young  (&)  Lucy  (&)  ...)) 

(state  "  CASlfip) 

(pstack  -  ((C:^  &  CA91>slpush-pop2))) 

(level  -  2) 

(regs  -  ((believer. reg  .  Cassie))) 

(held  -  NIL) 

(path  -  NIL) 

(weight  »  0) 

Break  eva 1 -pc-break 

<1>:  (setq  eva 1 -pc -mon i tor  nil) 

n  i  I 

[  NCW.  EVAL-PC-.VONITOR  IS  ALSO  OFF  ] 

<1>:  (set-bpoint  npdet) 

A  break-point  made  at  state -module  G^enpdet 

[  NOV.  A  BREAK  POINT  IS  SET  TO  THE  STATE  NCDULE  FC«  STATE  npdet  ] 

C(^onpde  t 

<1>:  (return  t) 

Break  (break-at-bpoint  .  G^enpdet) 

<1>;  (exam-queue) 

Process  Queue 

1 :  (1  nil  (young  «&  Lucy  &  . . . )  CA^npdet  . . . ) 

2;  (5  ni  1  (young  &  Lucy  &  .  . .)  CA^oipp  . . .) 

3:  (3  ni  1  (young  &  Lucy  &  . . . )  CA7oclause  .  .  . ) 

4:  (11  nil  (young  &  Lucy  &  . . .  )  GASbrulep  .  . . ) 

5;  (2  ni 1  (young  &  Lucy  &  . . . )  CA%rpl  . . . ) 

6:  O  -ni  1  (young  &  Lucy  &  . . . )  CASlsp  . . . ) 

7 ;  ( 2  n  i  1  ( young  &  Luc  y  &  . . . )  CASis  .  . . ) 

t 

<1>:  (return  t) 
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Break  ( break-at -bpoini  .  G^<iipdet) 

<1>:  (exam-queue) 

Process  Queue 

1 :  (1  nil  (Lucy  &  petted  &  . . . )  CA%npdet  . . . ) 

2:  (2  nil  (young  &  Lucy  &  .  . . )  C^«npdet  . . . ) 

3 ;  ( 5  n  i  1  ( young  &  Lucy  &  .  . . )  Cji^dipp  . . . ) 

4:  (3  ni  1  (young  &  Lucy  &  . . .  )  CAdlclause  . .  .  ) 

5:  (11  nil  (young  &  Lucy  &  .  . .  )  G4%rulep  . . . ) 

6:  (2  nil  (young  &  Lucy  &  . . .  )  O^orpl  . . . ) 

7 :  (2  nil  (young  &  Lucy  &  .  . . )  G^osp  . .  .  ) 

8 ;  ( 2  n  i  1  ( young  &  Lucy  &  . . . )  . .  . ) 

t 

<1>:  (exam-pc) 

Configuration  Frame  for  (G^cnpdet  .  l) 

(entry  -  l) 

(nodeset  -  NIL) 

(buffer  -  (Lucy  (&)  petted  (&  A)  ...)) 

(state  -  CA7<npdet) 

(pstack  »  ((C^^lause  A  CA9'oclause2push-popl )  (CA91>rulep  A  CIA0i>ruleplOpush-pop2)  ...)) 
( I  eve  I  -  5) 

(regs  »  ((*  .  young)  (•  .  young)  (lex  .  young)  (*  .  young)  ...)) 

(held  «  ((adj  5  A  nil))) 

(path  »  NIL) 

(weight  •  0) 

t 

<1>:  ( reset-entry  6) 
entry  value  is  set  to  6 
6 

[  NCW,  EmUY  VALUF.  IS  AI.THRFD  FRCM  1  TT)  6  ] 

<1>;  (exam-pc) 

Configuration  Frame  for  (CASinpdet  .  6) 

(entry  -  6) 

(nodeset  =  NIL) 

(buffer  »  (Lucy  (A)  petted  (A  A)  ...)) 

(state  -  CA9bnpdet) 

(pstack  -  ( (C/^oclause  A  CA^lause2push-pop]  )  (O^orul  ep  A  CA9bruleplOpush-pop2)  ...)) 
( leve 1  -  5) 

(regs  =  ((*  .  young)  (*  .  young)  (lex  .  young)  (*  .  young)  ...)) 

(held  «  ((adj  5  A  nil))) 

(path  =  NIL) 

(we  i  ght  =  0) 
t 

<1>:  (unset -bpoint  npdet) 

The  break-point  removed  from  state-module  0\9'erpdet 
CAStnpde  t 

[  ■mE  BREAK  POINT  AT  STATE  NCDULE  CASinpdet  IS  REMOVED  ] 

<1>:  (return  t) 

I  understand  that  young  Lucy  petted  a  yellow  dog 


(time  (sec.).  :  ->  (epu-  5.733  gc-  0.00)  <=) 


Figure  10.  A  parse  time  trace  of  the  control  driver 
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Parse  tasks 


Usage  of  sjrobols  for  task  identification 

S#  (Surface  to  surface  parsing)  : 

input  sentence  — >  system’s  response 
G*  (Semantic  structure  to  surface  parsing)  : 
»  system’s  expression 


SI:  young  Lucy  petted  a  yellow  dog 

— >  I  understand  that  young  Lucy  petted  a  yellow  dog 
G1 ;  »  young  Lucy  will  pet  a  yellow  dog 
G2:  »  young  Lucy  is  starting  to  pet  a  yellow  dog 
G3:  »  young  laicy  is  petting  a  yellow  dog 
G4:  »  young  Lucy  just  petted  a  yellow  dog 
G5:  >>  young  Lucy  petted  a  yellow  dog 
G6:  »  young  Lucy  wi  1 1  be  petting  a  yellow  dog 
G7 :  »  young  Lucy  was  petting  a  yellow  dog 
G8:  »  young  Lucy  would  pet  a  yellow  dog 
G9:  >>  young  I.ucy  wi  1 1  have  petted  a  yellow  dog 
S2:  what  is  yellow— >  a  dog  is  vellow 

S3:  dogs  are  animals  — >  I  understand  that  dogs  are  animals 
S4:  who  petted  an  animal  — >  young  Lucy  petted  a  yellow  animal 
S5:  Lucy  is  a  girl 

— >  1  understand  that  sweet  young  Lucy  is  a  girl 
S6:  girls  are  people  —>  1  understand  that  girls  are  people 
S7 :  what  was  petted  by  a  person 

— >  a  yellow  animal  was  petted  by  sweet  young  Lucy 
S8:  who  petted  a  dog 

— >  sweet  young  Lucy  petted  a  yellow  animal 
GIO:  »  a  yellow  animal 

S9:  who  petted  Rover  — >  sweet  young  Lucy  petted  yellow  Rover 


task 

BESSuSSlHi 

1 

compi 1 ed 

SI 

16.03 

2.93 

G5 

5.45 

1.73 

S2 

13.65 

4.53 

S7 

30.66 

10.18 

G1 

5.81 

1.81 

G6 

7.50 

2.38 

S3 

4.71 

1.53 

S8 

30.73 

10.21 

G2 

5.06 

1 .58 

G7 

6.93 

2.16 

S4 

30.06 

9.93 

GIO 

2.58 

0.88 

G3 

6.51 

2.05 

G8 

13.43 

4.28 

S5 

7.18 

2.26 

S9 

19.96 

6.65 

G4 

5.13 

1.65 

G9 

11.95 

3.91 

S6 

4.68 

1 .50 

Ratio 

3: 160  ; 

1 

(or  1 

0.316) 

Figure  11.  Efficiency  comparison  between  interpreted  and 
compiled  ATN  machines  with  Shapiro  (1985)  ATN  program 
(time  unit  -  CPU  seconds) 
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Appendix  B4: 


"SNePS  Considered  as  a  Fully  Intensional 
Propositional  Semantic  Network" 


Shapiro,  S.C. 
Rapaport,  RJ. 
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11.1.  Introduction 


This  chapter  presents  a  formal  syntax  and  semantics  for 
SNePS,  the  Semantic  A/etwork  Processing  System  (Shapiro. 
1979b). The  syntax  shows  the  emphasis  placed  on  SNePSs 
propositional  nature.  The  semantics,  which  is  based  on  Alexius 
Meinong’s  theory  of  intentional  objects  (the  objects  of  thought), 
makes  SNePS's  fnUy  intensional  nature  precise;  as  a  fully 
intensional  theory,  it  avoids  possible  worlds  and  is  appropriate 
for  AI  considered  as  "computational  philosophy"  -  A1  as  the 
study  of  how  intelligence  is  possible  -  or  "computational 
psychology"  -  Al  with  the  goal  of  writing  programs  as  models 
of  human  cognitive  behavior.  We  also  present  a  number  of 
recent  Al  research  and  applications  projects  that  use  SNePS. 
concentrating  on  one  of  these,  a  use  of  SNePS  to  model  (or 
construct)  the  mind  of  a  cognitive  agent,  referred  to  as  CASSIE 
(the  Cognitive  Agent  of  the  SNePS  System-an  /ntelligent 
£ntity). 


11.1.1.  The  SNePS  environment 

A  semantic  network  is  a  data  structure  typically 
consisting  of  labeled  nodes  and  labeled,  directed  arcs.  SNePS 
can  be  viewed  as  a  semantic  network  language  with  facilities 

for 

building  semantic  nelw'orks  to  represent  virtually 


'  '7  r.:s  research  wis  supported  ir,  part  b}'  t.he  Na!:o.eal  Science  Foundation 
uniier  Grant  No.  IST-8504713  and  SUNY  Buffalo  Research  Development  Fund 
grants  No.  150-9216-F  and  No.  150-8537-G  (Rapaport).  and  in  part  by  the  Air 
Force  Systems  Command,  Rome  Air  Development  Center,  Griffiss  Air  F'  rce 
Base.  NY  13441-5700,  and  the  Air  Force  Office  of  Scientific  Research.  Bolling 
AFB,  DC  20332  under  contract  No.  F30602-85-C-0008  (Shapiro).  We  wish  to 
thank  Michael  Almeida,  James  Gellcr,  Joao  Marlins,  Jeannette  Neal.  Sargur 
N.  Srihari.  Jennifer  Suchin,  and  Zhigang  Xiang  for  supplying  us  with 
descriptions  of  their  projects,  and  Randall  R.  Dipert.  the  members  of  SNeRG 
'th'  SNePS  Research  Group),  and  three  anonymous  reviewers  for  comments  and 
r*. . 
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any  kind  of  information  or  knowl«dge. 

2.  retrieving  information  from  them,  and 

3.  performing  inference  with  them,  using  SNIP  (the 
5NePS  /nference  /^ckage)  and  path-based  inference. 

Users  cam  interact  with  SNePS  in  a  vau'iety  of  interface 
languages,  including:  SNePSUL.  a  USP-like  SNePS  User 

Language:  SENECA,  a  menu-based,  screen-oriented  editor; 
GINSENG,  a  ^•^phics-oriented  editor;  SNePSLOG.  a  higherTorder- 
logic  language  Xin  the  sense  in  which  PROLOG  is  a  first-order- 
logic  language)  (McKay  and  Martins.  1981).  (Shapiro.  McKay. 
Martins,  and  Morgado.  1981);  and  an  extendible  fragment  of 
English,  using  an  ATN  parsing  and  generating  grammar 
(Shapiro,  1982),  see  Figure  11-1. 


SNePS 

SNIP 

4  4 


SNePSUL 


SENECA 


T 

SNePSLOG 


GINSENG 


ATN 


Figure  II- 1;  SNePS.  SNIP  and  Their  User  Interfaces. 

SNePS.  SNTP  (the  SNePS  Interface  Package)  and  their  user 
interfaces.  When  the  arcs,  case  frames,  and  ATN  grammar  arc 
those  of  SNeP^CASSIE,  then  tlie  system  is  being  used  to 
model  CASSIE.  When  the  arcs  are  for  the  database  «,see 
Section  11.4.1),  then  the  system  is  being  used  as  a  database 
management  system,  etc. 
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SNePS  is  the  descendeni  of  SAMENLAQ  (Shapiro. 
Woodmansee.  and  Kreuger.  1968)  (Shapiro  and  Woodmansee. 
1969)  and  MENTAL  (Shapiro.  1971b).  (Shapiro.  1971c).  It  was 
developed  with  the  help  of  the  SNePS  Research  Group  at 
Indiana  University  and  at  the  University  at  Buffalo.  The 
current  version  is  implemented  in  Franz  LISP  and  runs  on 
VAX  ll/750s  and  780s  in  the  Department  of  Computer  Science 
at  Buffalo.  An  earlier  version  was  implemented  in  ALISP  on  a 
CDC  Cyber  730:  and  an  updated  version  is  being  implemented 
in  Common  LISP  on  Symbolics  LISP  machines,  TI  Explorers, 
and  a.  Tektronix  4406.  There  are  additional  installations  at 
other  universities  in  the  U.S.  and  Europe. 


11.1.2.  SNePS  as  a  knowledge  representation  system 

Some  researchers,  for  example.  (Levesque  and  Brachman, 
1985).  view  a  knowledge  representation  (KR)  system  ds  a 
subsystem  that  manages  the  knowledge  base  of  a  knowledge- 
based  system  by  storing  information  and  answering  questions. 
In  contrast,  we  view  SNePS  as  the  entire  knowledge-based 
system,  interacting  with  a  user/interlocuior  through  one  of  its 
interfaces.  Of  course,  the  user/interlocutor  could  be  another 
computer  program  using  SNePS  as  a  subsystem,  but  that  is  not 
the  way  we  use  it. 

A  basic  design  goal  of  SNePS  and  its  ancestors  was  to  be 
an  environment  within  which  KR  experiments  could  be 
performed,  that  is.  to  be  a  semantic  network  at  the  "logical" 
level,  to  use  Brachman’s  term  (Brachman.  1979).  see  Section 
11.5.  below.  This  has  been  effected  by  providing  a  rather  low 
level  interface.  SNePSUL.  Using  SNePSUL.  a  KR  designer  can 
specify  a  syntax:  individual  arc  labels  and  sets  of  arc  labels 
(or  case  frames)  that  will  be  used  to  represent  various  objects 
and  information  about  them.  It  is  also  the  designer’s  obligation 
to  supply  a  semantics  for  these  case  frames.  As  is  the  case 
for  any  provider  of  a  language  or  "shell",  we  cannot  be 
responsible  for  what  use  others  make  of  the  facilities  we 
provide.  .Nevertheless,  we  have  our  own  preferred  use. 

In  this  chapter,  we  try  to  do  two  things.  First,  we  try 
to  provide  an  understanding  of  SNePS  and  of  some  of  the  uses 
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to  v/hich  ii  has  been  put.  Second,  and  most  imporlantly.  we 

present  our  own  preferred  use:  this  is  to  use  SNePS.  with  a 

particular  set  of  arc  labels  and  case  frames,  and  a  particular 
parsing/generating  grammar  for  a  fragment  of  English,  as  (a 
model  of)  the  cognitive  agent,  CASSIE.  We  shall  refer  to 
SNePS  with  these  arcs,  case  frames,  and  grammar  as 
SNePS/CASSIE.  SNePS/CASSIE  forms  CASSIEs  mind  and 
stands  as  our  current  theory  of  KR  at  the  "conceptual"  level 
(cf.  Section  11,5.  below,  and  (Brachman,  1979)).  The  purpose 

of  the  central  part  of  this  paper  is  to  present  this  theory  by 
explaining  the  entities  represented  by  structures  in 

SNePS/CASSIE.  by  giving  a  formal  syntax  and  semantics  for 
those  structures,  and  by  showing  and  explaining  a  sample 
conversation  with  CASSIE. 


11.1^.  Informal  description  of  SNePS 

Regardless  of  the  intentions  of  a  KR-system  designer, 
SNePS.  as  a  KR  formalism,  provides  certain  facilities  and  has 
certain  restrictions.  The  facilities  (for  example,  for  building, 
finding,  and  deducing  nodes)  are  best  understood  as  those 
provided  by  SNePSUL.  but  we  shall  not  give  a  complete 
description  of  SNePSUL  here.  {For  an  example,  cf.  Section 
11.4.1.  below:  for  details,  see  (Shapiro,  1979b).]  The 
restrictions,  however,  axe  important  to  understand,  because  they 
distinguish  SNePS  from  a  general  labelled,  directed  graph  and 
from  many  other  semantic  network  formalisms. 

SNePS  is  a  propositional  semantic  network.  By  this  is 
meant  that  all  information,  including  propositions,  "facts",  etc., 
is  represented  by  nodes.  The  benefit  of  representing 
propositions  by  nodes  is  that  propositions  about  propositions  can 
be  represented  with  no  limit.  (In  the  formal  syntax  and 
semantics  given  in  Section  11.3.  the  propositions  are  the  nodes 
labelled  'm'  or  >*.) 

Arcs  merely  form  the  underlying  syntactic  structure  of 
SNePS.  This  is  embodied  in  the  restriction  that  one  cannot  add 
an  arc  between  two  existing  nodes.  That  would  be  tantamount 
to  telling  SNePS  a  proposition  that  is  not  represented  as  a 
node.  There  are  a  few  built-in  arc  labels,  used  mostly  for 
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rule  nodes.  Paths  of  arcs  can  also  be  defined,  allowing  for 
path-based  inference,  including  properly  inheritance  within 
generalization  hierarchies  [see  Section  11.3.4,  below;  cf.  Shapiro 
(Shapiro.  1978).  (Srihari.  198 1).  and  (Tranch.  1982).]  All  other 
arc  labels  are  defined  by  the  user,  typically  at  the  beginning  of 
an  interaction  with  SNePS.  although  new  labels  can  be  defined 
at  any  time. 

For  purposes  of  reasoning,  propositions  that  are  asserted 
in  SNePS  must  be  distinguished  from  those  propositions  that 
are  merely  represented  in  SNePS  but  not  asserted.  This  could 
happen  in  the  case  of  a  proposition  embedded  in  another  (for 
example.  "Lucy  is  rich"  embedded  in  ."John  believes  that  Lucy 
is  rich").  SNePS  interprets  a  proposition  node  to  be  asserted  if 
and  only  if  it  has  no  arcs  pointing  to  it.^' 

Another  restriction  is  the  Uniqueness  Principle:  There  is  a 
one-to-one  correspondence  between  nodes  and  represented 
concepts.  This  principle  guarantees  that  nodes  will  be  shared 
whenever  possible  and  that  nodes  represent  intensional  objects.^^ 
We  next  consider  the  nature  of  these  objects. 

11.2.  Intensional  knowledge  representation 

SNePS  can  be  used  to  represent  propositions  about  entities 
in  the  world  having  properties  and  standing  in  relations. 
Roughly,  nodes  represent  the  propositions,  entities,  properties, 
and  relations,  while  the  arcs  represent  structural  links  between 
these. 

SNePS  nodes  might  represent  exiensional  entities.  Roughly, 
extensional  entities  are  those  whose  "identity  conditions"  (the 
conditions  for  deciding  when  "two"  of  them  are  really  the 
"same")  do  not  depend  on  their  manner  of  representation.  They 


■^^This  j5  not  really  a  restriction  of  SNePS,  but  of  SNIP  (the  SNePS  Inference 
Package)  and  path-based  inference. 

'^^In  (Maida  and  Shapiro,  1982)  this  name  was  given  to  only  half  of  the 
Uniqueness  Principle  as  stated  here:  "each  concept  represented  in  the  network  is 

represented  by  a  unique  node"  (page  2*>1). 
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may  be  characterized  as  those  entities  satisfying  the  following 
rough  principle: 


Two  extensional  entities  are  equivalent  (for  some  purpose) 
if  and  only  if  they  are  identical'^^ 

For  example,  the  following  are  extensional: 

the  Fregeon  referent  of  on  expression; 
physical  objects; 
sentences ; 
truth  va I ues ; 

mothemot i ca I  objects  such  os: 
sets, 

functions  defined  in  terms  of  their  input-output 
behavior  (that  is.  as  sets  of  ordered  poirs), 
n— place  relations  defined  in  terms  of  sets  of 
ordered  n— tuples. 

Although  SNePS  can  be  used  to  represent  extensional 
entities  in  the  world,  we  believe  that  it  must  represent 
inlensional  entities.  Roughly,  intensional  entities  are  those  whose 
identity  conditions  do  depend  on  their  manner  of  representation. 
They  are  those  entities  that  satisfy  the  following  rough 
principle: 


Two  intensional  entities  might  be  equivalent  (for  some) 
purpose  without  being  identical  (that  is,  they  might 
really  be  two,  not  one). 

Alternatively,  intensional  entities  may  be  characterized  as 
satisfying  the  following  five  criteria: 


1.  They  are  non-substitutible  in  referentially  opaque 
contexts. 

2.  They  can  be  indeterminate  with  respect  to  some 
properties. 

3.  They  need  not  exist. 


33 


that  is,  if  and  only  if  "they" 


are  really  one  entity,  not  two 
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4.  They  need  noi  be  possible. 

5.  They  can  be  distinguished  even  if  they  are 
necessarily  identical  (for  example,  the  stun  of  2  and 
2  and  the  sum  of  3  and  1  are  distinct  objects  of 
thought). 

For  example,  the  following  are  intensional: 

the  Fregean  sense  of  an  expression; 

concept  s ; 

propos i t i ons ; 

propert i es; 

algor i thma; 

objects  of  thought,  including: 

fictional  entities  (such  os  Sherlock  Holnes). 
non-exi stents  (such  os  the  golden  mountain), 
impossible  objects  (such  os  the  round  squore) 

Only  if  one  wants  to  represent  the  relations  between  a 
mind  and  the  world  would  SNePS  also  have  to  represent 
extensional  entities  [cf.  (Rapaport.  1976).  (Rapaport.  1978). 
(McCarthy.  1979)].  However,  if  SNePS  is  used  just  to  represent 
a  mind  -  that  is.  a  mind’s  model  of  the  world-then  it  does  not 
need  to  represent  any  extensional  objects.  SNePS  can  then  be 
used  either  to  model  the  mind  of  a  particular  cognitive  agent 
or  to  build  such  a  mind  -  that  is.  to  6c  a  cognitive  agent 
itself. 

There  have  been  a  number  of  arguments  presented  in 
both  the  AI  and  philosophical  literature  in  the  past  few  years 
for  the  need  for  intensional  entities.  (Castaneda,  1974).  (Woods. 
1975).  (Rapaport.  1976),  (Rapaport.  19S5a).  (Brachman.  1977). 
(Routley.  1979).  cf.  (Rapaport.  1984a).  (Parsons.  1980).  cf. 
(Rapaport.  1985b)).  Among  them,  the  following  considerations 
seem  to  us  to  be  especially  significant: 


Principle  of  Fine-Grained  Representation: 

The  objects  of  thought  (that  is.  intentional  objects) 
ore  intensional:  a  mind  can  have  two  or  more  objects  of 
thought  that  correspond  to  only  one  extensional  object. 

To  take  the  classic  example,  the  Morning  Star  and  the  Evening 
Star  might  be  distinct  objects  of  thought,  yet  there  is  only  one 
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extensional  object  (viz..  a  certain  astronomical  body) 
corresponding  to  them. 

Principle  of  Displacement: 

Cognitive  ogents  con  think  ond  tolk  obout  non— existents: 

o  mind  con  hove  on  object  of  thought  that  corresponds  to 

no  extensional  object. 

Again  to  take  several  classic  examples,  cognitive  agents  can 
think  and  talk  about  fictional  objects  such  as  Santa  Claus, 
possible  but  non-existing  objects  such  zis  a  golden  mountain, 
impossible  objects  such  as  a  round  square,  and  possible  but 
not-yet-proven-to-exist  objects  such  as  theoretical .  entities  (for 
example,  black  holes). 

If  nodes  only  represent  intensional  entities  (and 
extensional  entities'  are  not  represented  in  the  network),  how  do 
they  link  up  to  the  external,  extensional  world?  In 
SNePS/CASSIE.  the  answer  is  by  means  of  a  LEX  arc  (see 
syntactic  formation  rule  SR.l  and  semantic  interpretation  rule 
SI.l  in  Section  11.3.3.  below);  the  nodes  at  the  head  of  the 
LEX  arc  are  our  (the  user's)  interpretation  of  the  node  at  its 
tail.  The  network  without  the  LEX  arcs  and  their  head-nodes 
displays  the  structure  of  CASSIE’s  mind  [cf.  (Carnap.  1967), 
Section  11.14]. 

A  second  way  that  nodes  can  be  linked  to  the  world  is 
by  means  of  sensors  and  effectors,  either  linguistic  or  robotic. 
The  robotic  sort  has  been  discussed  in  (Maida  and  Shapiro. 
1952).  Since  so  many  A1  understanding  systems  deal 
exclusively  with  language,  here  we  consider  a  system  with  a 
keyboard  as  its  sense  organ  and  a  CRT  screen  as  its  only 
effector. 

Since  the  language  system  interacts  with  the  outside 
world  only  through  language,  the  only  questions  we  can 
consider  about  the  connections  of  its  concepts  with  reality  are 
questions  such  as: 

Does  it  use  words  os  we  do? 

When  it  uses  word  >v.  does  it  meon  the  same  thing  os 
when  I  use  it? 

When  I  use  word  w.  does  it  understand  whot  I  mean? 
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The  perceptual  system  of  the  language  system  is  its 
parser/analyzer  -  the  programs  that  analyze  typed  utterances 
and  build  pieces  of  semantic  network.  The  motor  system  is 
the  generator  -  the  programs  that  analyze  a  section  of  the 
semantic  network  and  construct  an  utterance  to  be  displayed  on 
the  CRT.  One  crucial  requirement  for  an  adequate  connection 
with  the  world  is  simple  consistency  of  input-output  behavior. 
That  is.  a  phrase  that  is  analyzed  to  refer  to  a  particular  node 
should  consistently  refer  to  that  node,  at  least  while  there  is 
no  change  in  the  network.  Similarly,  if  the  system  generates  a 
certain  phrase  to  describe  the  concept  represented  by  a  node,  it 
should  be  capable  of  generating  that  same  phrase  for  that  same 
node,  as  long  as  nothing  in  the  network  changes.  Notice  that 
it  is  unreasonable  to  require  that  if  a  phrase  is  generated  to 
describe  a  node,  the  analyzer  should  be  able  to  find  the  node 
from  the  phrase”  The  system  might  know  of  several  brown 
dogs  and  describe  one  as  "a  brown  dog  ;  it  could  not  be 
expected  to  find  that  node  as  the  representation  of  "a  brown 
dog"  consistently. 

If  we  are  assured  of  the  simple  input-output  consistency 
of  the  system,  the  main  question  left  is  whether  it  uses  words 
to  mean  the  same  thing  as  we  do.  It  is  the  same  question 
that  we  would  be  concerned  with  if  we  were  talking  with  a 
blind  invalid,  although  in  that  case  we  would  assume  the 
answer  was  'Yes'  until  the  conversation  grew  so  bizzare  that 
we  were  forced  to  change  our  minds.  As  the  system  (or  the 
invalid)  uttered  more  and  more  sentences  using  a  particular 
word  or  phrase,  we  would  become  more  and  more  convinced 
that  it  meant  what  we  would  mean  by  it.  or  that  it  meant 
what  we  might  have  described  with  a  different  word  or  phrase 
("Oh!  When  you  say  'conceptual  dependency  structure’,  you 
mean  what  I  mean  when  I  say  'semantic  network'."),  or  else 
that  we  didn't  know  what  was  meant,  or  that  it  was  not 
using  it  in  a  consistent,  meaningful  way  (and  hence  that  the 
system  (or  invalid)  did  not  know  what  it  was  Ulking  about). 
As  long  as  the  conversation  proceeds  without  our  getting  into 
the  latter  situation,  the  system  has  all  the  connections  with 
reality  it  needs. 
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11.3.  Description  of  SNePS/CASSEE 

In  this  section,  we  introduce  CASSIE.  and  give  the  syntax 
and  semantics  for  SNePS/CASSIE  in  terms  of  a  philosophical 
theory  of  mental  entities  inspired  by  Alexius  Meinong's  Theory 
of  Objects. 


113.1.  CASSIE  -  A  model  of  a  mind 

SNePS  nodes  represent  the  objects  of  CASSIE’s  thoughts  - 
the  things  she  thinks  about,  the  properties  and  relations  with 
which  she  characterizes  them,  her  beliefs,  her  judgments,  etc. 
[cf.  (Maida  and  Shapiro.  1982),  (Rapaport.  19S5a)].  According 
to  the  Principle  of  Displacement,  a  cognitive  agent  is  able  to 
think  about  virtually  anything,  including  fictional  objects, 
possible  but  non-existing  objects,  and  impossible  objects.  Any 
theory  that  would  account  for  this  fact  requires  a  non-standard 
logic,  and  its  semantics  cannot  be  limited  to  merely  possible 
worlds.  (Otherv/ise,  it  could  not  account  for  impossible 
objects.  This  accounts  for  the  difficulties  David  Israel  has  in 
providing  a  possible-worlds  semantics  for  SNePS  (Israel,  1983). 
(cf.  (Rapaport.  1985a)).  Theories  based  on  the  Theory  of 
Objects  of  the  tum-of-the-century  Austrian  philosopher- 
psychologist  Alexius  Meinong  are  of  precisely  this  kind. 

For  present  purposes,  it  will  be  enough  to  say  that 
Meinong  held  that  psychological  experiences  consist  in  part  of  a 
psychological  act  (such  as  thinking,  believing,  judging,  wishing, 
etc.)  and  the  object  to  which  the  act  is  directed  (for  example, 
the  object  that  is  thought  about  or  the  proposition  that  is 
believed).  T'vi^  kinds  of  Meinongian  objects  of  thought  are 
relevant  for  us: 


1.  The  objectwn.  or  object  of  "simple"  thoughts;  Santa 
Qaus  is  the  objeclum  of  John’s  act  of  thinking  of 
Santa  Claus.  Objecta  are  the  meanings  of  noun 
phrases. 

2.  The  objective,  or  object  of  belief,  knowledge,  etc.: 
that  Santa  Claus  is  thin  is  the  objective  of  John  s 
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act  of  believing  that  Santa  Claus  is  thin.  Objectives 
are  like  propositions  in  that  they  are  the  meanings 
of  sentences  and  other  sentential  structures. 

It  is  important  to  note  that  objecta  need  not  exist  and  that 
objectives  need  not  be  true.  [For  details,  see:  (Meinong.  1904), 
(Findlay.  1963).  (Rapapori.  1976).  (Rapaport.  1978).  (Rapaport. 
1981).  (Rapaport.  1982).  (Castaneda,  1974).  (Castaneda.  1975a). 
(Castaneda.  1975b),  (Casteneda.  1975).  (Castaneda.  1977), 
(Castaneda.  1979),  (Tomberlin.  1984),  and  (Routley.  1979);  cf. 
(Rapaport.  1984a).  (Parsons.  1980):  cf.  (Rapaport,  1985b), 
(Lambert.  1983);  cf.  (Rapaport.  1986a),  (Zalu.  1983).] 

This  is,  perhaps,  somewhat  arcane  terminology  for  what 
might  seem  like  AI  common  sense.  But  without  an  underlying 
theory,  such  as  Meinong's.  there  is  no  way  to  be  sure  if 
common  sense  can  be  trusted.  It  is  important  to  note  that  not 
only  are  all  represented  things  intentional,  but  that  they  are 
all  objects  of  CASSIE's  mental  acts;  that  is.  they  are  all  in 
CASSIE's  mind  (her  "belief  space")  -  they  are  all  intentional. 
Thus,  even  if  CASSIE  represents  the  beliefs  of  someone  else 
(for  example.  John's  belief  that  Lucy  is  rich,  as  in  the 
conversation  in  the  next  section),  the  objects  that  she  represents 
as  being  in  that  person's  mind  (as  being  in  his  "belief  space") 
are  actually  CASSIE's  representations  of  those  objects  -  that  is. 
they  are  in  CASSIE's  mind. 


113.2.  A  conversation  with  CASSIE 

Before  giving  the  syntax  and  semantics  of  the  case,  frames 
employed  in  representing  CASSIE's  "mind",  we  present  a 
conversation  we  had  with  her.  An  ATN  parser/generator 
(Shapiro.  1982)  was  used  to  parse  the  English  input  into  SNePS 
and  to  generate  the  SNePS  structures  in  English.  User  input  is 
on  the  lines  beginning  with  the  :-prompt;  CASSIE’s  output  is 
on  the  lines  that  follow.  (Execution  and  garbage-collection 
times  have  been  edited  out  for  clarity.) 

:  Young  Lucy  potted  a  yellow  dog 

I  understood  that  young  Lucy  petted  o  yellow  dog 

:  Whot  is  yellow 
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o  dog  is  yellow 

:  Dogs  ore  on  into  Is 
I  understood  thot  dogs  ore  onimols 

:  Who  petted  on  onimol 

young  Lucy  petted  o  yellow  dog 

:  Lucy  is  sweet 

I  understood  thot  young  Lucy  is  sweet 
Lucy  is  0  girl 

I  understand  that  sweet  young  Lucy  is  o  girl 

:  Girls  ore  people 
I  understood  that  girls  ore  people 

:  Whot  was  petted  by  a  person 
o  yellow  dog  wos  petted  by  sweet  young  Lucy 

;  The  dog  is  named  Rover 
J  understand  that  Rover  is  o  yellow  dog 

:  Whot  wos  petted  by  a  person 

yellow  Rover  was  petted  by  sweet  young  Lucy 

:  John  believes  that  Lucy  is  rich 
I  understand  that  John  believes  that  Lucy  is  rich 

:  Who  is  rich 
I  don't  know. 

:  Who  is  sweet 
young  Lucy  is  sweet 

:  John  believes  thot  Lucy  is  old 

I  understand  that  John  believes  thot  rich  Lucy  is  old 
:  John  i s  o  boy 

1  understand  thot  John  is  a  boy 

:  Boys  ore  people 
I  understand  that  boys  ore  people 

:  Dogs  ore  pets 

I  understand  thot  dogs  ore  pets 

:  For  every  p  ond  d  if  p  is  o  person  and  d  is  o  pet  then  p 
loves  d 

I  understond  that  for  every  d  ond  p,  if  p  is  o  person 
and 

d  is  o  pet 


2B-150 


then  p  loves  d 


I 

j  :  Who  loves  o  pet 

'  sweet  young  Lucy  loves  yellow  Rover 

ond 

\  John  loves  yellow  Rover 


1133.  Syntax  and  semantics  of  SNePS/CASSEE 

In  this  section,  we  give  the  syntactic  formation  rules  (SR) 
and  semantic  interpretations  (SI)  for  the  nodes  and  arcs  used  in 
this  interaction,  together  with  some  other  important  ones.  We 
return  to  a  more  detailed  examination  of  the  interaction  in  the 
next  section.  What  we  present  here  is  our  current  model:  we 
make  no  claims  about  the  completeness  of  the  representational 
scheme.  In  particular,  we  leave  for  another  paper  a  discussion 
of  such  structured  individuals  as  the  golden  mountain  or  the 
round  square,  which  raise  difficult  and  important  problems 
with  predication  and  existence.  (For  a  discussion  of  these 
issues,  see  (Rapaport.  1978).  (Rapaport.  1985a).] 

Information  is  represented  in  SNePS  by  means  of  nodes 
and  arcs.  Since  the  meaning  of  a  node  is  determined  by  what 
it  is  connected  to  in  the  network,  there  are  no  isolated  nodes. 
Nodes  that  only  have  arcs  pointing  to  them  are  considered  to 
be  unstructured  or  atomic.  They  include: 

(Al)  sensory  nodes,  which  represent  interfaces  with  the 

external  world  (in  the  exoroples  thot  follow,  they  will 
represent  words,  sounds,  or  utterances); 

(A2)  base  nodes,  which  represent  constont  individuol 
concepts  ond  properties; 

(A3)  variable  nodes,  which  represent  orbitrory  individuals 
(cf.  (Fine,  1983))  or  orbitrory  propositions. 

Molecular  nodes,  which  have  arcs  emanating  from  them,  include: 

(Ml)  structured  individucd.  nodes,  which  represent  structured 
individual  concepts  or  properties  (that  is.  concepts 
and  properties  represented  in  such  o  woy  that  their 
internol  structure  is  exhibited;  see  the  discussion 
of  structured  information  in  (Woods.  1975)); 

(M2)  Structured  proposition  nodes,  which  represent 
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propoa i t i ons ;  thoae  with  no  incoming  area  repreaent 
beliefs  of  the  ayatem.^^  (Note  that  atructured 
propoaition  nodea  con  oiao  he  conaidered  to  be 
atructured  indi viduala. }  Propoaition  nodea  ore  either 
atomic  (representing  atomic  propositions)  or  ore 
ride  nodes.  Rule  nodea  represent  deduction  rules 
ond  ore  used  by  SNIP  (the  SNePS  Inference  Pockoge)  tor 

node— bosed  deductive  inference.^^ 

For  each  of  the  three  categories  of  molecular  nodes  (structured 
individuals,  atomic  propositions,  and  rules),  there  are  constant 
nodes  of  that  category  and  pattern  nodes  of  that  category 
representing  arbitrary  entities  of  that  category. 

The  rules  labeled  ‘SR*,  below,  should  be  considered  as 
syntactic  formation  rules  for  a  non-linear  network  language. 
The  semantic  interpretations,  labeled  ‘ST.  are  in  terms  of 
Meinongian  objecta  and  objectives,  which  are  intentional  objects, 
that  is.  objects  of  thought.  Since  intentional  objects  are 
intensional.  our  Meinongian  semantics  is  an  extensional 
semantics  oyer  a  domain  of  intensional  entities  (Meinongian 
objects). 

We  begin  with  a  few  definitions.^^ 

Definition  1 

A  node  dominates  another  node  if  there  is  a  path  of  directed 


34 

There  is  a  need  to  distinguish  structured  proposition  nodes  with  no 
incoming  arcs  from  structured  individual  nodes  ivith  no  incoming  arcs;  the 
latter,  of  course,  are  not  beliefs  of  the  system.  This  is  handled  by  the 
syntactic  formation  rules  and  their  semantic  interpretations.  There  is  also  a  need 
to  distinguish  bet^'een  beliefs  of  the  system  and  those  propositions  that  the 
system  is  merely  contemplating  or  "assuming"  temporarily  (cf.  (Meinong.  1983)). 
We  are  currently  adding  this  capability  to  SNePS  by  means  of  an  assertion 
operator  (’!’). 

^^For  details,  see  (Shapiro,  1977),  (Shapiro,  1978),  (McKay  and  Shapiro. 
1980),  (McCarty  and  Sridharan,  1981),  (Shapiro  and  McKay,  1980),  (Shapiro, 
Marlins,  and  McKay,  1982),  (Martins,  1983a). 


■*^050  are  actually  only  rough  definitions;  the  interested  reader  is  referred 
to  (Shapiro,  1979b),  Section  2.1,  for  more  precise  ones. 
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arcs  from  ihe  first  node  to  the  second  node. 

Definition  2 

A  pattern  node  is  a  node  that  dominates  a  variable  node. 
Definition  3 

An  irulividucd.  node  is  either  a  base  node,  a  variable  node,  or  a 
structured  constant  or  pattern  individual  node. 

Definition  4 

A  proposition  node  is  either  a  structured  proposition  node  or  an 
atomic  variable  node  representing  an  arbitrary  proposition. 

SR.l  If  "w"  is  an  English  word  and  "i"  is  an  identifier  not 
previously  used,  then 

0 

LEX 

0 

is  a  network,  w  is  a  sensory  node,  and  i  is  a  structured 
individual  node. 

SI.l  i  is  the  Meinongian  objectum  corresponding  to  the 
utterance  of  w. 

SIl.2  If  either  "tj"  and  ^2"  identifiers  not  previously  used, 
or  "tj"  is  an  identifier  not  previously  used  and  I2  is  a  temporal 
node,  then 
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BEFORE 


is  a  network  and  tj  and  t2  are  lemporal  nodes,  that  is 
individual  nodes  representing  times. 

SI.2  tj  and  t2  are  Meinongian  objecta  corresponding  to  two 
time  intervals,  the  former  occurring  before  the  latter. 

SR.3  If  :  and  j  are  individual  nodes,  and  "m"  is  an  identifier 
not  previously  used,  then 


is  a  network  and  m  is  a  structured  proposition  node. 

SI3  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  Meinongian  objecta  i  and  j  (are  believed  by 
CASSIE  to)  correspond  to  the  same  actual  object.  (This  is  not 
used  in  the  conversation,  but  is  needed  for  fully  intensional 
represenuiional  systems:  cf.  (Rapaport.  1978:RAPA84b)  and 
(Castaneda.  1974:CAST75b)  for  analyses  of  this  sort  of 
relation,  and  (Maida  and  Shapiro.  1982)  for  a  discussion  of  its 
use.) 

SR,4  If  i  and  j  are  individual  nodes  and  "m"  is  an  identifier 
not  previously  used,  then 
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is  a  nelwork  and  m  is  a  structured  proposition  node. 

SI.4  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  i  has  the  property  J. 

SR^  If  i  and  J  are  individual  nodes  and  "m"  is  an  identifier 
not  previously  used,  then 


is  a  network  and  m  is  a  structured  proposition  node. 

SI-5  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  Meinongian  objectum  i’s  proper  name  is  j.  {j  is 
the  .Meinongian  objectum  that  is  i's  proper  name:  its  expression 
in  English  is  represented  by  a  node  at  the  head  of  a  LEX-arc 
emanating  from  y.) 

SR.6  If  i  and  j  are  individual  nodes  and  "m"  is  an  identifier 
not  previously  used,  then 
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MEMBER 


.CLASS 


IS  a  network  and  m  is  a  structured  proposition  node. 

SI. 6  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  t  is  a  (member  of  class)  j. 

SR. 7  If  i  and  j  are  individual  nodes  and  "m"  is  an  identifier 
not  previously  used,  then 


is  a  network  and  m  is  a  structured  proposition  node. 

SI.7  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  (the  class  of)  is  are  (a  subclass  of  the  class 
of)  js. 

SR.8  If  ij,  i2,  ij  are  individual  nodes,  tj,  are  temporal  nodes, 
and  "m'  is  an  identifier  not  previously  used,  then 
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is  a  network  and  m  is  a  structured  proposition  node. 

SI.8  m  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  agent  performs  act  '■O  ^5  starting  at 

time  Cj  and  ending  at  time  ’^'here  (j  is  before  t2- 


It  should  be  noted  that  the  ETIME  and  STIME  arcs  are 
optional  and  can  be  part  of  any  proposition  node.  They  are  a 
provisional  technique  for  handling  the  representation  of  acts 
and  events:  our  current  research  on  temporal  representation  is 
much  more  complex  and  is  discussed  in  Section  11.4.7.  below. 

SR.9  If  nij  is  a  proposition  node,  i  is  an  individual  node,  j  is 
the  (structured  individual)  node  with  a  LEX  arc  to  the  node, 
believe,  and  "m^"  is  an  identifier  not  previously  used,  then 
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is  a  network  and  is  a  structured  proposition  node. 

SI.9  is  the  Meinongian  objective  corresponding  to  the 

proposition  that  agent  i  believes  proposition  rrij. 


Two  special  cases  of  SR-9  that  are  of  interest  concern  de  re 
and  de  dicto  beliefs:  they  are  illustrated  in  Figure  11-2  and 
Figure  11-3.  [For  details,  see  (Rapaport  and  Shapiro. 
1984)  and  (Rapaport.  1984b),  (Rapaport,  1986b).] 

SR.IO  If  nij,  ...  ,  are  proposition  nodes  (n  ^  0).  V  and  "/ 
are  integers  between  0  and  n.  inclusive,  and  "r"  is  an  identifier 
not  previously  used,  then 


is  a  network,  and  r  is  a  rule  node. 

SI.10  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  there  is  a  relevant  connection  between 
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AGENT. 


OBJECT 


Figure  11-2:  Meinongian  Objective  -  de  re  Reading 

is  the  Meinongian  objective  corresponding  to  the 
proposition  that  agent  ij  believes  de  re  of  objectum  ij  (who 
is  believed  by  CASSIE  to  be  named  iJ)  that  it  has  the 
property  ij. 

propositions  . such  that  at  least  i  and  at  most  iCj)  of 

them  are  simultanenously  true. 


Rule  r  of  SR/Sl.lO  is  called  AND-OR  and  is  a  unified 
generalization  of  negation  (i  =  J  =  0),  binary  conjunction  (i  =  j 
=  2).  binary  inclusive  disjunction  (i  =  1.  j  =  2).  binary 
exclusive  disjunction  (i  =  0.  y  =  l),  etc. 

SR.ll  If  ruj . are  proposition  nodes  (n  <  0),  is  an 

integer  between  0  and  n.  inclusive,  and  V"  is  an  identifier  not 
previously  used,  then 
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is  the  Meinongian  objective  corresponding  to  the 
proposition  that  agent  ij  believes  de  dicto  that  objectum  tj 
(who  is  believed  by  to  be  named  ij)  has  the  property  iy 


is  a  network,  and  r  is  a  rule  node. 

SI.ll  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  there  is  a  relevant  connection  between 

propositions  . such  that  either  fewer  than  i  of  them 

are  true  or  they  all  are  true. 

Rule  r  of  SR/SI.ll  is  called  THRESH  and  is  a  generalization  of 
the  material  biconditional  (i  -  1). 
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SR-12  If  aj . a^,  Cj . Cj,  and  d j . are  proposition 

nodes  fn  ^  /;  j,  k  ^  0;  j  ■¥  k  ^  1).  V  is  an  integer 
between  1  and  n.  inclusive,  and  V"  is  an  identifier  not 
previously  used,  then 


is  a  network,  and  r  is  a  rule  node. 

SI.12  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  the  conjunction  of  any  i  of  the  propositions  a^, 
...  .  relevantly  implies  each  (]  ^  Z  ^  j)  and  relevantly 
implies  each  d^  (J  ^  I  <  k)  for  which  there  is  not  a  better 
reason  to  believe  it  is  false. 

SR.13  If  Cj . a^,  Cj,  ...  ,  Cj,  and  dj . d^^  are  proposition 

nodes  (n,  j,  k  ^  0),  and  "r"  is  an  identifier  not  previously 
used,  then 


is  a  network,  and  r  is  a  rule  node. 

SI.  13  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  the  conjunction  of  the  propositions  Oj,  ...  , 
relevantly  implies  each  (J  ^  I  ^  j)  and  relevantly  implies 
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each  d.  (1  <  I  <  k)  for  which  there  is  not  a  belter  reason  to 
believe  it  is  false. 

The  d.  are  default  consequences,  in  the  sense  that  each  is 
implied  only  if  it  is  neither  the  case  that  CASSIE  already 
believes  not  d^  nor  that  not  follows  from  non-default  rules. 

SR. 14  If  Cj . a^,  Cj,  ...  ,  Cj,  and  dj . are  proposition 

nodes  {n  ^  1:  j,  k  ^  O,  j  +  k  ^  1),  and  V“  is  an  identifier 
not  previously  used,  then 


is  a  network,  and  r  is  a  rule  node. 

SI.  14  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  any  a.,  J  ^  i  ^  n.  relevantly  implies  each 
(1  ^  I  ^  J)  and  relevantly  implies  each  d^  (I  ^  I  ^  k)  for 
which  there  is  not  a  better  reason  to  believe  it  is  false. 

SR.  15  If  m  is  a  proposition  node,  and  V"  is  an  identifier  not 
previously  used,  then 

o 

0 

is  a  network,  and  r  is  a  rule  node. 
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51.15  r  is  the  Meinongian  objeciive  corresponding  to  tbe 
proposition  that  there  is  no  good  reason  for  believing 
proposition  m. 

SR.16  If  r  is  3  rule  node  as  specified  by  SR.10-SR.15,  and  r 
dominates  variable  nodes  v,,  ...  .  v„.  and.  in  addition,  arcs 

I  n 

labeled  "AVB"  go  from  r  to  each  v^.,  then  r  is  a  quantified  rule 
node. 

51.16  r  is  the  Meinongian  objective  corresponding  to  the 
proposition  that  the  rule  that  would  be  expressed  by  r  without 
the  AVB  arcs  holds  after  replacing  each  v.  by  any  Meinongian 
object  in  its  range. 

SR.17  If  r  is  a  rule  node  as  specified  by  SR.10-SR.15,  and  r 
dominates  variable  nodes  v^,  ...  ,  v^.  and,  in  addition,  arcs 
labeled  "EVB"  go  from  r  to  each  v.,  then  r  is  a  quantified  rule 
node. 

51.17  r  is  the  Meinongian  objective  corresponding  to  tbe 
proposition  that  the  rule  that  would  be  expressed  by  r  without 
the  EVB  arcs  holds  after  replacing  each  v^.  by  some  Meinongian 
object  in  its  range. 

SR.18  If  Qj . and  c  are  proposition  nodes:  v j,  ...  ,  are 

variable  nodes  dominated  by  one  or  more  of  Qj,  ...  ,  a^,  c;  i  . 
“/•  and  "n“  are  integers  10  ^  i  ^  j  ^  mn)'.  and  V"  is  an 
identifier  not  previously  used;  then 
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IS  a  network,  and  r  Ls  a  rule  node. 


SI.18  r  is  the  Meinongian  objeciive  corresponding  to  the 
proposition  that,  of  the  n  sequences  of  Meinongian  objects 
which,  when  substituted  for  the  sequence  v^,  ...  ,  v^.  make  all 
the  a.  believed  propositions,  between  i  and  j  of  them  also 
satisfy  c.  (For  further  details  on  such  numerical  quantifiers,  see 
(Shapiro.  1979c).) 


113.4.  The  conversation  with  CASSIE,  revisited 

In  this  section,  we  shall  review  the  conversation  we  had 
with  CASSIE.  showing  the  network  structure  as  it  is  built  - 
that  IS.  showing  the  structure  of  CASSIE’s  mind  as  she  is  given 
information  and  as  she  infers  new  information.  (Comments  are 
preceded  by  a  dash.) 

:  Young  Lucy  petted  o  yellow  dog 
I  understand  that  young  Lucy  petted  o  yellow  dog 

—  CASSIE  is  told  something,  which  she  now  believes.  Her 
entire  belief  structure  is  shown  in  Figure  11-4  (o). 

The  node  lobeled  "now”  represents  the  current  time,  so 
the  petting  is  clearly  represented  os  being  in  the  post. 
CASSIE's  response  is  ”I  understond  thot"  oppended  to  her 
English  description  of  the  proposition  just  entered. 


:  Whot  is  yellow 
a  dog  is  yellow 

-  This  response  shows  thot  CASSIE  octuol ly  hos  some 
beliefs;  she  did  not  just  porrot  bock  the  above 
sentence . 

:  Dogs  are  animals 
I  understand  that  dogs  ore  onimols 

-  CASSIE  is  told  a  smoll  section  of  o  doss  hierorchy. 

:  Who  petted  on  on i mo  I 

young  Lucy  petted  a  yellow  dog 

-  CASSIE  con  answer  the  question  using  the  class 
hierorchy,  because,  prior  to  the  conversot ion. 
the  inheritonce  rule 


2B-164 


r>o« 


Fragment  of  CASSIE's  belief  structure  after  being  told 
that  young  Lucy  petted  a  yellow  dog. 


(def-poth  class  (compose  doss  (kstar 

(compose  subcloss—  superc I oss) ) ) ) 

wos  given  to  SNePS .  This  rule  says  that  the  CLASS  ore  is 
implied  by  the  poth  consisting  of  a  CLASS  ore  followed 
by  zero  or  more  occurrences  of  the  two— ore  poth 
consisting  of  the  converse  SUBCLASS  ore  followed  by  the 
SUPERCLASS  arc  [sec  (Shopiro.  1978).  (Srihori.  1981)]. 
The  dog  wos  colled  "o  yellow  dog"  rother  then  "o  yellow 
on i mo  I"  becouse  the  redundont  CLASS  ore  is  not  built. 
Figure  11-5  shows  the  current  stote  of 

CASSIE’s  belief  structure  about  the  dog’s  clossif icotion 
ond  color. 


:  Lucy  is  sweet 

I  understond  thot  young  Lucy  is  sweet 
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Figure  11-5:  CASSlE's  Belief  Structure 

CASSIE's  belief  structure  about  the  dog’s  classification 
and  color.  (Node  ml2  represents  the  dog.) 

-  CASSIE's  response  shows  thot  she  identifies  this 
Lucy  with  the  previous  Lucy. 

:  Lucy  is  a  girl 

I  understand  that  sweet  young  Lucy  is  o  girl 

-  The  beginning  of  o  class  hierorchy  for  Lucy.  Notice 
that  all  the  adjectival  properties  of  Lucy  ore  mentioned. 

:  Girls  ore  people 

1  understand  that  girls  are  people 

-  More  of  the  class  hierarchy  is  given. 

:  What  was  petted  by  a  person 

a  yellow  dog  was  petted  by  sweet  young  Lucy 

-  Again,  the  proposition  is  retrieved  using  the  CLASS 
inheritance  rule.  The  answer  is  expressed  in  the 
passive  voice  because  of  the  woy  the  question  was 
asked . 

:  The  dog  is  nomed  Rover 

I  understand  that  Rover  is  a  yellow  dog 
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-  'the  dog’  refers  to  the  only  dog  CASS  IE  knows  obout , 
who  is  now  given  a  name. 

:  Whot  was  petted  by  o  person 

ye  Mow  Rover  wos  petted  by  sweet  young  Lucy 

-  This  is  exactly  the  some  question  that  wos  osked 
before.  It  is  answered  differently  this  time, 
because  the  dog  now  has  o  nome.  ond  CASSIE  prefers 
to  describe  on  individual  by- its  nome  when  it  has 
one . 

;  John  believes  that  Lucy  is  rich 

1  understand  thot  John  believes  thot  Lucy  is  rich 

-  At  this  point  in  our  development  of  CASSIE,  she 
interprets  'believes  thot*  contexts  to  be  de 
dicto,  so  she  ossumes  that  the  Lucy  that  John 
has  in  mind  is  o  different  one  from  the  Lucy 
thot  she  Isnows.  Figure  11—6  shows  CASSIE’s 
beliefs  about  the  two  Lucies. 


:  Who  is  rich 

I  don't  know. 

-  CASSIE  knows  no  one  who  is  rich.  She  only  believes 
that  John  believes  that  someone  (whom  she  believes 
thot  he  believes  to  be  nomed  ’Lucy*)  is  rich.  The 
answer  is  *1  don’t  know*,  rother  thon  ’no  one  is  rich’ 
becouse  CASSIE  doesn’t  use  the  closed— world  hypothesis 

:  Who  is  sweet 

young  Lucy  is  sweet 

—  This  question  is  osked  merely  to  demonstrate  thot  Lucy 
is  able  to  answer  a  "who  is  <property>'’  question  when 
she  hos  relevant  beliefs. 

;  John  believes  that  Lucy  is  old 

1  understand  that  John  believes  thot  rich  Lucy  is  old 

—  Even  though  CASSIE  ossumes  thot  John  knows  o 
different  Lucy  than  she  knows,  she  assumes 
that  all  Jahn's  beliefs  about  "Lucy**  ore 
about  the  same  Lucy. 

:  John  is  a  boy 

I  understand  that  John  is  a  boy 

-  This  ond  the  next  two  inputs  are  given  to 
estoblish  more  of  the  doss  hierorchy  and 
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cejfCT 


5T  ll 


oajccT 


»  s  t 


IT  «  f 


Figiire  11-6:  A  Fragment  of  the  Network 

A  Fragment  of  the  network  after  CASSIE  is  told  that 
John  believes  that  Lucy  is  rich,  showing  CASSlE's  beliefs 
abc'Ut  tlie  two  Lucies. 


to  moke  it  door  that  whop  CASSIE  onswers 
the  last  question  of  this  session,  she  is 
doing  both  path-based  reosoning  ond  node- 
based  reasoning  at  the  some  time. 

I  understand  that  boys  are  people 

:  Dogs  ore  pets 

1  understand  that  dogs  are  pets 


:  For  every  p  and  d  if  p  is  a  person  ond  d  is  o  pet  then  p 
loves  d 


1  understand  that  for  every  d  and  p.  if  p  is  a  person  ond 
d  is  0  pet 

—  Figure  11—7  shows  how  this  node— based 

rule  fits  into  the  doss  hierarchy.  This  is,  we 
believe,  equivalent  to  the  integrated  TBox/ABox 
nechanlsm  proposed  for  KRYPTON 
[  (Brochman,  Fikes,  and  Levesque.  19B3), 

(Brachraan.  Gilbert,  and  Levesque.  1965)]. 


Figure  11-7:  A  Node-based  Rule  in  a  Class  Hierarchy 

:  Who  loves  o  pet 

sweet  young  lucy  loves  yellow  Rover 
and 
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John  loves  yellow  Rover 


-  The  question  was  answered  using  poth-bosed  inferencing 
to  deduce  thot  Lucy  and  John  ore  people  and  that  Rover 
is  o  pet.  and  node— based  inferencing  to  conclude  that, 
therefore,  .ucy  and  John  love  Rover. 

-  The  full  network  showing  CASSlE's  stote  of  mind  ot  the 
end  of  the  conversation  is  given  in  Figure  11-8. 


Figxire  11-8:  CASSlE's  Beliefs  al  the  End  of  ihe  Conversation 
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11.4.  Extensions  and  applications  of  SNePS 


In  this  essay,  we  have  been  advocating  the  tise  and 
interpretation  of  SaNePS  networks  to  model  (the  beliefs  of)  a 
cognitive  agent.  SNePS.  however,  is  of  much  wider  and  more 
general  applicability.  In  this  section,  we  give  examples  of 
recent  and  current  research  projects  using  SNePS  in  belief- 
revision.  as  a  database  management  system,  for  developing 
several  expert  systems.  and  for  representing  temporal 
information  in  narratives.  Even  though  most  of  these  uses  of 
SNePS  do  not  explicitly  involve  a  cognitive  agent,  it  should  be 
noted  that  in  each  case  the  asserted  nodes  can  be  treated  as 
"beliefs"  of  the  system:  beliefs  about  the  database,  beliefs  about 
the  various  domains  of  the  expert  systems,  beliefs  about 
linguistics,  etc. 


11.4.1.  SNePS  as  a  database  management  system 

SNePS  can  be  used  as  a  network  version  of  a  relational 
database  in  v/hich  every  element  of  the  relational  database  is 
represented  by  an  atomic  node,  each  row  of  each  relation  is 
represented  by  a  molecular  node,  and  each  column  label 
(attribute)  is  represented  by  an  arc  label.  Whenever  a  row  r 
has  an  element  c  in  column  c.  the  molecular  node  representing 
r  has  an  arc  labeled  c  pointing  to  the  atomic  node  representing 
e.  Relations  (tables)  may  be  distinguished  by  either  of  two 
techniques,  depending  on  the  particular  relations  and  attributes 
m  the  relational  database.  If  each  relation  has  an  attribute 
that  does  not  occur  in  any  other  relation,  then  the  presence  of 
an  arc  labeled  with  that  attribute  determines  the  relationship 
represented  by  the  molecular  node.  A  review  of  the  syntax  of 
the  CASSIE  networks  wfll  show  that  this  technique  is  used 
there.  The  other  technique  is  to  give  every  molecular  node  an 
additional  arc  (perhaps  labeled  "RELATION")  pointing  to  an 
atomic  node  whose  identifier  is  the  name  of  the  relation.  Table 
11-1  shows  the  Supplier-Part-Project  database  of  (Date.  1981). 
p  114).  Notice  that  the  SNAME  and  STATUS  attributes  only 
occur  in  the  SUPPLIER  relation;  PNAME.  COLOR,  and 
weight  only  occur  in  the  PART  relation;  JNAMF:  only  occurs 
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in  the  PROJECT  relation:  and  QT^'  only  occurs  in  the  SPJ 
relation.  Figure  11-9  shows  the  SNePS  network  for  part  of 
this  database. 


Table  1:  SUPPLIER 


Table  4:  SPJ 


s# 

SNAME 

STATUS 

CITY 

si 

Smith 

20 

London 

s2 

Jones 

10 

Paris 

s3 

Blake 

30 

Paris 

s4 

Qark 

20 

London 

s5 

Adams 

30 

Athens 

Table  2:  PART 


PNAME 

COLOR 

WHGHT 

cnr 

nut 

red 

12 

London 

bolt 

green 

17 

Paris 

screw 

blue 

17 

Rome 

screw 

red 

14 

London 

cam 

blue 

12 

Paris 

cog 

red 

19 

London 

Tables:  PROJECT 


JNAME 

CITY 

sorter 

Paris 

punch 

Rome 

reader 

Athens 

console 

Athens 

collator 

London 

terminal 

Oslo 

tape 

London 

s# 

P# 

si 

pi 

si 

pi 

s2 

p3 

s2 

p3 

s2 

p3 

s2 

p3 
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Table  11-1:  Tables  Supplier  Part  Project  and  SPJ 

Many  database  retrieval  requests  may  be  formulated  using 
the  find  command  of  SNePSUL.  the  SNePS  User’s  Language. 
The  syntax  of  find  is  (find  Tj  n^  —  r^  n^).  where  r^  is 
either  an  arc  or  a  path,  and  Oj  is  either  a  node  or  a  set  of 
nodes  (possibly  the  value  of  a  nested  call  to  find).  The  value 
of  a  call  to  find  is  the  set  of  all  nodes  in  the  network  with 
an  rj  arc  to  any  node  in  the  set  nj.  an  arc  to  any  node  in 
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Figure  11-9:  Fragment  of  SNePS  Network  for  the 
Supplier-Part-Project  Database. 


the  sei  n, . and  an  r„  arc  to  anv  node  in  the  set  n^.  Free 

variables  are  prefixed  by  *?".  An  infix  sign  between  finds 

represents  the  set  difference  operator. 

The  session  below  shows  some  of  the  queries  from  (Date. 
1981);  pp  141-142  translated  into  find  commands,  and  the 
results  on  the  database  shown  above.  (In  each  interaction, 
comments  are  preceded  by  semicolons,  user  input  follows  the 
’’’-prompt,  and  SNePS  responses  are  on  succeeding  lines. 
Execution  and  garbage  collection  times  have  been  edited  out  for 
clarity.) 

:  G«t  full  details  of  oil  projects  in  London. 

•  (dump  (find  jnome  ?x  city  London)) 

(m18  (city  (London))  (jnome  (tope))  (jnum  (j7))) 

(m16  (city  (London))  (jnome  (collotor))  (jnum  (j5))) 

(dumped) 

;  Get  SNUM  values  for  suppliers  who  supply  project  J1 

;  with  port  P1 . 

•  (find  snum-  (find  jnum  jl  pnum  pi)) 

(sl) 

;  Get  JNAWE  values  for  projects  supplied  by  supplier  SI. 
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•  (find  (jnani«-  Jnun  jnum-  tnum)  t1) 

(con«ol«  aortar) 

;  Get  S|  values  for  auppllera  who  aupply  both  projects 
:  J1  ond  J2. 

•  (find  (snuifr-  Jnuis)  j1  (snun-  jnum)  j2) 

(s3  s2) 

;  Get  the  names  of  the  suppliers  who  supply  project  J1 
;  with  a  red  port. 

•  (find  (snome-  snum  snum-)  (find  jnum  j1  (pnum  pnum- 

color)  red)) 

(Smith) 

:  Get  S#  volues  for  suppliers  who  supply  a  London  or  Paris 
;  project  with  a  red  part. 

•  (find  snum-  (find  (jnum  jnum-  city)  (London  Poris) 

(pnum  pnum-  color)  red)) 

(s4  si) 

;  Get  P#  values  for  parts  supplied  to  any  project  by 
;  a  supplier  in  the  some  city. 

•  (find  pnum-  (find  (jnum  jnum-  city)  ?city  (snum  snum- 

city)  ?city)) 

(p5  p4  pi  p2  p6  p3) 

;  Get  Jf  values  for  projects  not  supplied  with  any  red  part 
:  by  ony  London  supplier.* 

•  ((find  jnum-  ?x)-(find  jnum-  (find  (pnum  pnum-  color)  red 

(snum  snum-  city)  London))) 

(j6  jS  j2) 

,  Get  Sf  values  for  suppliers  supplying  at  least  one  port 
;  supplied  by  ot  leost  one  supplier  who  supplies  at  least 
:  one  red  part. 

•  (find  (snum—  pnum  pnum—  snum  snum—  pnum  pnum—  color)  red) 
(s3  s4  s2  s5  si) 

;  Get  J|  values  for  projects  which  use  only  ports  which  ore 
ovailoble  from  supplier  SI. 

•  ((find  jnum-  (find  qty  7q)) 

—  (find  (jnum-  pnum)  (find  pnum—  ?r)  —  (find  (pnun>—  snum) 

«1))) 

n  i  I 


11.4.2.  Address  recognition  for  mail  sorting 

A  research  group  led  by  Sargur  N.  Srihari  is  studying 
address  recognition  techniques  for  automated  mail  sorting 
(Srihari.  Sargur,  Jonathan.  Palumbo.  Niyogi.  and  Wang.  1985). 
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Computer  determination  of  the  sort-destination  of  an  arbitrary 
piece  of  letter-mail  from  its  visual  image  is  a  problem  that 
remains  far  from  solved.  It  involves  overcoming  several 
sources  of  ambiguity  at  both  the  spatio-visual  and  linguistic 
levels:  The  location  of  the  destination  address  has  to  be 

determined  in  the  presence  of  other  text  and  graphics;  relevant 
address  lines  have  to  be  isolated  when  there  are  irrelevant  lines 
of  text  in«-the  address  block:  the  iconic  shapes  of  characters 
have  to  be  classified  into  words  of  text  when  numerous  types 
of  fonts,  sizes,  and  printing  media  are  present:  and  the 
recognized  words  have  to  be  verified  as  having  the  syntax  and 
semantics  of  an  address. 

Spatial  relationships  between  objects  are  essential 
knowledge  sources  for  vision  systems.  This  source  extends 
naturally  to  the  postal-image  understanding  problem,  because  of 
strong  directional  expectations.  For  example,  the  postage  mark  is 
usually  above  and  to  the  right  of  the  destination  address,  and 
the  return  address  is  usually  to  the  left  of  the  postage.  A 
semantic  network  is  a  natural  representation  for  geometric 
relations. 

An  envelope  image  is  segmented  into  blocks,  and  a  SNePS 
network  is  built  that  represents  the  geometric  relations  between 
blocks  and  information  about  the  relative  and  absolute  area 
occupied  by  each  block,  A  preliminary  set  of  geometric 
relations  are  the  eight  compass  points.  Relative  area  occupancy 
is  expressed  as  the  percentage  of  each  block  that  falls  in  each 
of  nine  equal  rectangular  subdivisions  of  the  envelope  image, 
and  absolute  area  is  given  in  terms  of  the  number  of  pixels 
covered  by  each  block.  The  program  constructs  an  exhaustive 
representation  of  all  the  geometric  relations  present  in  the 
image.  Given  the  image  produced  by  an  initial  segmentation 
procedure,  a  rough,  intuitive  output,  shown  in  Figure  11-10 
with  some  arc  labels  removed  for  clarity)  was  produced. 

Future  work  in  this  area  includes  refinement  of  the  data 
structure  to  represent  more  information  more  efficiently  and 
the  addition  of  inferencing  capabilities  whose  objective  is  to 
present  the  control  structure  with  tentative  decisions  about  the 
address  block  based  only  on  the  information  provided  by  the 
initial  segmentation. 
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Figiire  ll-lCh  SNePS  Network  Representation  of  Initial 

Segmentation  of  Envelope  Image 
(from  Srihari.  Hull  et  al.  1985) 


11.43.  NEUREX 

The  NEUREX  project  (Cheng.  1984).  (Xiang  and  Srihari. 
1985),  (Xiang,  Srihari.  Shapiro  and  Chutkow.  984"),  (Suchin. 
1985)  is  a  diagnostic  expert  system  for  diseases  of  the  central 
and  peripheral  nervous  systems:  it  also  deals  with  information 
about  neuroaffectors.  neuroreceptors,  and  body  parts.  SNePS  is 
used  to  represent  spatial  structures  and  functions 
propositionally.  Entities  are  represented  topologically  by  means 
of  proposition  nodes  expressing  an  entity's  shape,  position,  etc., 
and  spatial  relations  are  represented  by  proposition  nodes 
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expressing  adjacency,  conneciiviiy.  direction,  etc.  This  approach 
integrates  structural  and  functional  neuroanatomical  information. 
Moreover,  the  representation  is  both  propositional  and  analog. 
For  the  j)eripheral  nervous  system,  there  are  nodes  representing 
such  propositions  as  that,  for  example,  a  sequence  of  nerve 
segments  are  linked  at  junctions,  and  that  the  whole  sequence 
forms  a  (peripheral)  nerve:  the  network  that  is  built  is  itself 
an  analog  representation  of  this  nerve  (and  ultimately,  together 
with  its  neighbors,  of  the  entire  peripheral  nervous  system).  See 
Chapter  15  for  further  discussion  of  analog  representations.  For 
the  central  nervous  system,  there  are  coordinates  in  the 
network  representation  that  can  be  used  to  support  reasoning 
by  geometrical  computation  or  graphical  interfaces. 

As  one  example,  the  network  of  Figure  11-11  can  be  used 
by  the  system  to  determine  which  muscles  are  involved  in 
shoulder-joint  flexion,  using  the  SNePS  User  Language  request 

(find  (ms-  cn)  (find  jt  shoulder-joint  mv  flexion)), 

which  returns  the  following  list  of  four  nodes: 

(deltoid  pec tora I i 3_ma j or_c I ov i cu I Qr_heod 

cor ocobroch i a  1 i s  biceps_brochi i ^ 

Furthermore,  rules,  like  that  shown  in  Figure  11-12.  can  be 
employed  and  can  even  include  probabilistic  information.  (Note 
that  node  r  in  Figure  11-12  is  the  SNePS  implementation  of 
the  IF-THEN  rule:  cf.  (SR. 13).) 


11.4.4.  Representing  visual  knowledge 

The  goal  of  the  Versatile  Maintenance  Expert  System 
(VMES)  project  is  to  develop  an  expert  maintenance  system 
that  can  reason  about  digital  circuits  represented  graphically 
(cf.  (Shapiro.  Srihari.  Geller.  and  Taie.  1986:SSTGS6)).  A 
similar  perspective  on  the  need  for  visual  knowledge 
representation  is  taken  by  Tsotsos  and  Shibahara  (Chapter  10) 
and  Havens  and  Mackworth  (Chapter  16).  The  representation 
is  not  pixel-oriented:  this  is  a  project  in  visual  knowledge 
representation  integrated  with  more  traditional  conceptual  and 
propositional  knowledge  repre.sentation.  The  graphical  form  of 
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MCtocalls  cocaeobcaehlal  is  bleeps  bcschii 


Four  of  the  shoulder-joint  movements  with  musscles  involved 
and  their  contribution  to  each  relevant  movement.  (Meaning 
of  the  arc  labels:  jt-joint;  mv-movement;  ms-muscle; 
cn-contribute;  pr-percentage.)  (From  Xiang  and  Srihari  1985) 

an  object  is  a  LISP  function  that,  when  evaluated,  draws  the 
object  on  the  screen.  Propositional  nodes  express  information 
about  (l)  the  relative  or  absolute  position  of  the  object  and  (2) 
attributes  of  the  object.  Visual  knowledge  can  also  be 
distributed  among  nodes  in  traditional  hierarchies:  for  example, 
the  knowledge  of  how  to  display  a  particular  hammer  may  be 
stored  at  the  level  of  the  class  of  hammers:  the  knowledge  of 
how  to  display  a  person  may  be  distributed  among  the  nodes 
for  heads,  arms.  etc. 

For  example.  Figure  11-13  shows  a  set  of  three  assertions. 
Node  m233  represents  the  assertion  that  the  object 
TRIANGLE-1  is  100  units  to  the  right  and  20  units  below  the 
object  SQUARE-1.  The  MODALITY  arc  permits  the  selection  of 
different  modes  of  display;  here,  we  want  to  display 
TRIANGLE-1  in  "functional"  mode.  Node  m220  states  that 
every  member  of  the  class  TTIIANGLE  displayed  in  functional 
mode  has  the  form  DTRIANG  associated  with  it.  Finally,  node 
m219  asserts  that  TRIANGLE-1  is  a  TRIANGLE. 
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Figure  11-12:  SNePS  Network  for  a  NEUREX  Rule. 

(From  Xiang  and  Srihari  1985) 


Figure  11-13:  SNePS  Network  in  VMES  for  the  Form 
and  Relative  Position  of  TRIANGLE-1. 


Figure  11-14  contains  four  assertions,  of  which  node 
m246  is  the  most  complex.  It  links  the  object  GATE-1  to  an 
absolute  position  at  100/400  and  to  the  class  of  all  AND-gales. 
Node  m244  asserts  that  GATE-1  is  a  part  of  BOARD-1.  Node 
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m248  asserts  that  INPl-GATEl  is  a  PART-OF  GATE-1  and 
belongs  to  the  class  AINPl.  The  label  'PART'  actually  stands 
for  "has  part".  Node  in239  links  the  attribute  BAD  to  GATE-1. 
Every  attribute  belongs  to  an  attribute  class,  and  the  arc 
ATTRIBUTE-CLASS  points  to  the  class  STATE. 


Figure  11-14: 


SNePS  Network  in  VMES  for  the  Location. 
Structure,  and  State  of  GATE-1. 


11.4.5.  SNcBR:  A  belief  revision  package 

The  SNePS  Inference  Package  has  been  extended  by  Joao 
Martins  to  handle  belief  revision  -  an  area  of  AI  research 
concerned  with  the  issues  of  revising  sets  of  beliefs  when  a 
contradiction  is  found  in  a  reasoning  system.  Research  topics 
in  belief  revision  include  the  study  of  the  representation  of 
beliefs,  in  particular  bow  to  represent  the  notion  of  belief 
dependence:  the  development  of  methods  for  selecting  the  subset 
of  beliefs  responsible  for  contradictions;  and  the  development  of 
techniques  to  remove  some  subset  of  beliefs  from  the  original 
set  of  beliefs.  (For  an  overview  of  the  field,  see  (Martins. 
1987).) 

SNeBR  iSNePS  .Belief  Revision)  is  an  implementation  in 
SNePS  of  an  abstract  belief  revision  system  called  the  Multiple 
Belief  Reasoner  (MBR).  which,  in  turn,  is  based  on  a  relevance 
logic  system  called  SWM  (after  Shapiro.  Wand,  and  Martins) 
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(Shapiro  and  Wand.  1976).  (Martins.  1983b).  (Marlins.  1983a). 
(Marlins  and  Shapiro.  1984).  (Martins  and  Shapiro.  1986a). 
(Marlins  and  Shapiro.  1986b).  (Marlins  and  Shapiro.  1986c). 
SWM  contains  the  rules  of  inference  of  MBR  and  defines  how 
contradictions  are  handled.  The  only  aspect  of  SWM  relevant 
to  this  description  concerns  the  objects  with  which  MBR  deals, 
called  supported  wffs.  They  are  of  the  form 

A  i  r,  o,  r 

where  A  is  a  well-formed  formula  representing  a  proposition,  t 
is  an  origin  tag  indicating  how  A  was  obtained  (for  example, 
as  a  hypothesis  or  as  a  derived  proposition),  o  is  an  origin  set 
containing  all  and  only  the  hypotheses  used  to  derive  A.  and  r 
is  a  restriction  set  containing  information  about  contradictions 
known  to  involve  the  hypotheses  in  o.  The  triple  t,  o,  r  is 
called  the  support  of  the  wff  A.  The  origin  tag.  origin  set. 
and  restriction  set  of  a  wff  are  computed  when  the  wff  is 
derived,  and  its  restriction  set  may  be  updated  when 
contradictions  are  discovered. 

f 

MBR  uses  the  concepts  of  context  and  belief  space.  A 
context  IS  any  set  of  hypotheses.  A  context  determines  a  belief 

space,  which  is  the  set  of  all  the  hypotheses  defining  the 

context  together  with  all  propositions  derived  exclusively  from 
them.  The  propositions  in  the  belief  space  defined  by  a  given 
context  are  characterized  by  having  an  origin  set  that  is 
contained  in  the  context.  At  any  point,  the  set  of  all 

hypotheses  under  consideration  is  called  the  current  context, 
which  defines  the  current  belief  space.  The  only  propositions 
that  are  retrievable  at  a  given  time  are  the  ones  belonging  to 
the  current  belief  space. 

A  contradiction  may  be  detected  either  because  an 

assertion  is  derived  that  is  the  negation  of  an  assertion  already 
in  the  network,  or  because  believed  assertions  invalidate  a  rule 
being  used  (particularly  an  AND-OR  or  a  THRESH  rule;  see 
(SR/SI.lO-ll)).  In  the  former  case,  the  contradiction  is  noted 
when  the  new,  contradictory,  assertion  is  about  to  be  built  into 
the  network,  since  the  Uniqueness  Principle  guarantees  that  the 
contradictory  assertions  will  share  network  structure.  In  the 
latter  case,  the  contradiction  is  noted  in  the  cour.se  of  applying 
the  rule.  In  the  former  ca.se.  it  may  be  that  the  contradictory 
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assertions  are  in  different  belief  spaces  (only  the  new  one  being 
in  the  current  belief  space).  If  so.  the  restriction  sets  are 
updated  to  reflect  the  contradictory  sets  of  hypotheses,  and 
nothing  else  happens.  If  the  contradictory  assertions  are  both 
in  the  current  belief  space  (which  will  be  the  case  when  one 
of  them  is  a  rule  being  used),  then,  besides  updating  the 
restriction  sets,  the  user  will  be  asked  to  delete  at  least  one  of 
the  hypotheses  underlying  the  contradiction  from  the  current 
context.  Management  of  origin  sets  according  to  SWM 
guaraiitees  that,  as  long  as  the  current  context  was  originally 
not  known  to  be  contradictory,  removal  of  any  one  of  the 
hypotheses  in  the  union  of  the  origin  sets  of  the  contradictory 
assertions  from  the  current  context  will  restore  the  current 
context  to  the  state  of  not  being  known  to  be  inconsistent. 


1 1.5.  Knowledge-based  natural  language 
understanding 

Jeannette  Neal  has  developed  an  AI  system  that  can  treat 
knowledge  of  its  own  language  as  its  discourse  domain.  (Neal. 
1985).  The  system's  linguistic  knowledge  is  represented 
deciaratively  in  its  network  knowledge  base  in  such  a  way  that 
it  can  be  used  in  the  dual  role  of  "program"  to  analyze 
language  input  to  the  system  and  "data"  to  be  queried  or 
reasoned  about.  Since  language  forms  (pari  of)  its  domain  of 
discourse,  the  system  is  also  able  to  learn  from  the  discourse 
by  being  given  instruction  in  the  processing  and  understanding 
of  language.  As  the  system's  language  knowledge  is  e.xpanded 
beyond  a  primitive  kernel  language,  instructions  can  be 

expressed  in  an  increasingly  sophisticated  subset  of  the  language 
tx?ing  taught  Thus,  the  system's  language  is  used  as  its  own 
metalanguage. 

The  kernel  language  consists  of  a  relatively  small 
collection  of  predefined  terms  and  rewrite  rules  for  expressing 
synux  and  for  expressing  the  mapping  of  surface  strings  to  the 
representation  of  their  interpretations. 

The  knowledge  representations  include  representations  for 
surface  strings  and  for  relations  .such  as:  (a)  a  lexeme  being  a 
nvmtx’r  of  a  certain  lexical  category,  (b)  bou:Kie<l  string  D 
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being  in  category  C  and  this  phrase  structure  being  represented 
by  concept  N.  (c)  a  structure  or  parsed  string  expressing  a 
certain  concept,  and  (d)  one  phrase  structure  being  a 
constituent  of  another  structure. 

In  order  to  talk  about  both  the  syntax  and  semantics  of 
language,  the  network  representations  distinguish  between  a 
word  or  string  and  its  interpretation.  In  one  experiment,  the 
statements 

(1)  A  WOMAN  IS  A  human 

(2)  -woman*  is  singular 

were  input  to  the  system.  The  first  makes  a  claim  about 
women;  the  second  makes  a  claim  about  the  word  'woman'. 
Nodes  m40  and  m50  of  Figure  11-15.  respectively,  represent 
the  propositions  expressed  by  these  statements.  The  concept  or 
class  expressed  by  ’WOMAN’  is  represented  by  node  b22:  the 
entity  represented  by  node  b22  is  a  participant  in  the  subset- 
superset  proposition  expressed  by  (l).  However,  in  the 
representation  of  (2).  the  word  ’WOMAN’  itself  is  the  entity 
having  the  property  SINGULAR. 

Additional  statements,  such  as: 

(R)  IF  THE  head-noun  OF  A  NOUN-PHRASE  X 

HAS  NUMBER  Y.  THEN  X  HAS  NUMBER  Y. 

were  input  to  the  system  to  demonstrate  the  use  of  a  subset 
of  English  as  its  own  metalanguage  in  building  up  the  system  s 
language  ability  from  its  pri.milive  predefined  language.  Figure 
11-16  illustrates  the  representation  of  the  systems 
interpretation  of  rule  (R)  as  well  as  the  representation  of 
certain  linguistic  relations.  Node  m87  represents  the  proposition 
that  some  bounded  string  represented  by  variable  node  vm  is  in 
the  category  HEAD-NOU.N.  and  this  phrase  structure  i.s 
represented  by  variable  node  v3.  Node  mSS  represents  that  the 
phrase  structure  represented  by  node  v3  is  a  constituent  of  vl. 
which  represents  a  NOUN-PHRASE  structure.  (In  this  figure, 
the  AVB  arcs  have  been  eliminated  for  clarity:  cf.  (SR/S1.16).) 
Ar.  soon  as  a.ny  rule  su^h  as  (R)  is  parsed  and  interpreted,  it 
IS  immediately  available  for  use  in  subsequent  processing. 
Thus,  the  .sys-tem  is  continuously  edvitable  and  car.  use  it.s 
ia rouag'’  a::  iis  ow'n  metalangu.i;’'- 
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Figure  11-15:  Representation  of  the  Interpretation  of  « 

Statements  About  Linguistic  and  Non- 
linguistic  Entities. 


11^.1.  Temporal  structure  of  narrative 

Michael  Almeida  is  using  SNePS  in  the  development  of  a 
system  that  will  be  able  to  read  a  simple  narrative  text  and 
construct  a  model  of  its  temporal  structure  (Almeida  and 
Shapiro,  1983).  (Almeida.  1986).  This  project  uses  an  event- 
based.  rather  than  a  proposition-based,  approach;  that  is. 
intervals  and  points  of  time  are  associated  with  evenLs 
represented  as  objects  m  the  network  rather  than  with  the 
propositions  that  describe  them  The  temporal  model  itself 
consists  of  these  intervals  and  points  of  time  related  to  one 
another  by  such  relation.';  as  BHl'OR  I'/AITFR . 
f 3 1  i R I .\G/0 ) \'I’ A 1  \S.  etc . 
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Figure  11-17:  SNePS  .\eiwork  for  a  Short  Narrative. 

Subscripts  are  used  in  the  figure  to  show  the  successive  values 
of  NOW. 

The  BEFORE-AFTER-DURATION  case  frame  is  used  to 
indicate  that  the  period  of  time  pointed  to  by  the  BEFORE-arc 
temporally  precedes  the  period  of  time  pointed  to  by  the 
AFTER-arc  by  the  length  of  time  pointed  to  by  the 
DURATION-arc.  These  durations  are  usually  not  known 
precisely.  The  value  <epsilon>  stands  for  a  very  short 
interval;  whenever  an  event  occurs  in  the  narrative  line,  it  has 
the  effect  of  moving  NOW  an  interval  of  <epsilon>  beyond 

It , 
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The  DURING-CONTAINS  case  frame  is  used  to  indicate 
that  the  period  of  time  pointed  to  by  the  DURING-arc  is 
during  (or  contained  in)  the  period  of  time  pointed  to  by  the 
CONTAINS-arc.  Notice  that  the  progressive  sentence.  "The  sun 
was  setting",  created  an  event  that  contains  the  then-current 
NOW.  If  the  system  knows  about  such  things  as  sunsets,  then 
it  should  infer  that  the  event  of  the  sun's  setting  also  contains 
John's  arrival,  his  ringing  of  the  bell,  and  probably  also  Mary's 
opening  of  the  door. 

11.6.  Conclusion:  SNePS  and  SNePS/CASSIE  as 
Semantic  Networks 

We  shall  conclude  by  looking  at  SNePS  and 
SNePS/CASSIE  from  the  perspective  of  Brachman's  discussions 
of  structured  inheritance  networks  such  as  KL-One  and 
hierarchies  of  semantic  network  formalisms  (Brachman.  1977. 
Brachman.  1979).  . 

11.6.1.  Criteria  for  semantic  networks 

Brachman  offers  six  criteria  for  semantic  networks: 

A  semantic  network  must  have  a  uniform  notation.  SNePS 
provides  some  uniform  notation  with  its  built-in  arc  labels  for 
rules,  and  it  provides  a  uniform  procedure  for  users  to  choose 
their  own  notation. 

,A  semantic  network  must  have  an  alporit/im  for  encoding 
information.  This  is  provided  for  by  the  interlace,''  to  SNePS. 
fnr  e.xample.  by  the  parser  component  of  our  ATN  parser- 
generator  that  takes  English  sentences  as  input  and  produces 
SNAPS  networks  as  output. 

A  semantic  network  must  have  an  assimilation 
mechanism  for  building  new  information  in  terms  of  stored 
information.  SNePS  provides  for  this  by  the  Uniqueness 
Principle,  which  enforces  node  sharing  during  network  building. 
The  assimilation  is  demonstrated  by  the  generator  component  of 
our  ATN  parser-generator,  which  takes  .SNePS  nodes  as  input 
and  produces  English  output  expressing  those  nodes:  Our 

:  rv. 'T  ition  with  CASSIP  illustrated  tfm  t.'ie  is'de  built  to 
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represent  the  new  fact.  "Lucy  is  sweet",  is  expressed  in  terms 
of  the  already  existing  node  for  Lucy  (who  had  previously 
been  described  as  young)  by  "young  Lucy  is  sweet*. 

A  semantic  network  should  be  neutral  with  respect  to 
network  formalisms  at  higher  levels  in  the  Brachman  hierarchy. 
SNePS  is  a  semantic  network  at  the  "logical"  level,  whereas 
SNePS/CASSIE  is  at  the  "conceptual"  level.  SNePS  is  neutral  in 
the  relevant  sense;  it  is  not  so  clear  whether  SNePS/CASSIE  is. 
But  neutrality  at  higher  levels  may  not  be  so  important:  a 
more  important  issue  is  the  reasons  why  one  formalism  should 
be  chosen  over  another.  Several  possible  criteria  that  a 
researcher  might  consider  are:  efficiency  (including  the  ease  of 
interfacing  with  other  modules;  for  example,  our  ATN  parser- 
generator  has  been  designed  for  direct  interfacing  with  SNePS). 
psychological  adequacy  (irrelevant  for  SNePS.  but  precisely  what 
SNePS/CASSrc  is:  being  designed  for),  ontological  adeqxmcy 
(irrelevant  for  SNePS/CASSIE-see  below),  logical  adequacy 
(guaranteed  for  SNePS.  because  of  its  inference  package),  and 
natural  language  adequacy  (a  feature  of  SNePS’s  interface  with 
the  ATN  grammar). 

A  semantic  network  should  be  adequate  for  any  higher- 
level  network  formalism.  SNePS  meets  this  nicely:  KL-One 
can  be  implemented  in  SNePS  (Tranch.  1982). 

A  semantic  network  should  have  a  semantics.  We 
presented  that  in  Section  11.3.  But  ii  should  be  observed  that 
there  are  at  least  two  very  different  sorts  of  semantics.  In 
SNePS.  nodes  have  a  meaning  within  the  system  in  terms  of 
their  links  to  other  nodes;  they  have  a  meaning  for  users  as 
provided  by  the  nodes  at  the  heads  of  LEX  arcs.  Arcs,  on  the 
other  hand,  only  have  meaning  within  the  system,  provided  by 
node-  and  path-based  inference  rules  (which  can  be  thought  of 
as  procedures  that  operate  on  the  arcs).  In  both  cases,  there  is 
an  "internal",  system  semantics  that  is  holistic  and  structural; 
the  meaning  of  the  nodes  and  arcs  are  not  given  in  isolation, 
but  in  terms  of  the  entire  network.  This  sort  of  "syntactic" 
.semantics  differs  from  a  semantics  that  provides  links  to  an 
external  interpreting  system,  such  as  a  user  or  the  "world"  - 
that  is.  links  between  the  network  s  way  of  repre.senting 
information  and  the  user's  way.  Ii  is  the  latter  .son  of 
semantics  that  we  provided  for  SNePS/(L4SSI [;  with  respect  to 
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an  ontology  of  Meinongian  objects. 

11.6^.  SNePS  and  SNePS/CASSIE  vs.  KL-One 

SNePS  and  SNePS/CASSIE  can  be  compared  directly  to 
KL-One.  Unlike  KL-One.  which  is  an  inheritance-net'voTk 
formalism  for  representing  concepts,  instances  of  concepts,  and 
properties  and  relations  among  them.  SNePS  is  a 
propositioncd.-ntv*/ ork  formalism  for  representing  propositions 
and  their  constituents  (individuals,  properties,  and  relations). 

Nevertheless.  SNePS  can  handle  inheritance.  We  have 
already  seen  an  example  of  inheritance  by  pafh-based  inference 
in  the  conversation  with  CASSIE.  In  that  example,  inheritance 
could  also  have  been  accomplished  through  node-based  inference 
by.  for  example,  representing  ‘dogs  are  animals’  as  a 
universally-quantified  rule  rather  than  by  a  SUBCLASS- 
SUPERCLASS  case  frame.  That  is,  where  an  inheritance 
network  might  express  the  claim  that  dogs  are  animals  by  a 

single  arc  (say.  a  subclass-arc)  from  a  dog-node  to  an  animal- 
node.  SNePS  could  express  it  by  a  proposition  (represented  by 
node  ml 7  in  Figure  11-5.). 

One  advantage  of  the  propositional  mode  of  representation 
and,  consequently,  of  the  second,  or  ni/e-based.  form  of 
property  inheritance  is  that  the  proposition  (ml7)  expressing 
the  relationship  can  then  become  the  objective  of  a  proposition 
representing  an  agent's  belief  or  it  can  become  the  antecedent 

or  consequent  of  a  node-based  rule.  In  some  inheritance 
networks,  this  could  only  be  done  by  choosing  to  represent  the 
entire  claim  by  either  the  dog-node,  the  animal-node,  the 
subclass-arc.  or  (perhaps)  the  entire  structure  consisting  of  the 
two  nodes  and  the  arc.  The  first  two  options  seem  incorrect; 

the  third  and  fourth  either  introduce  an  anomaly  into  the 

representation  (since  arcs  can  then  point  either  to  nodes  or  to 
other  arcs  or  to  structures),  or  it  reduces  to  what  SNePS  does; 
SNePS.  in  effect,  trades  in  the  single  arc  for  a  node  with  two 
outgoing  arcs.  In  this  ’-''ay.  the  arcs  of  inheritance  networks 
become  information-bearing  nodes,  and  the  semantic  network 
svstem  become.s  a  propositional  one. 

.Secomi,  Kl.-One  use;,  "epistemological]'.’  primitive  links". 
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Bui  why  does  KL-One  use  the  particular  set  of  links  thai  ii 
does,  and  noi  some  other  set:  that  is.  what  is  the  ontological 
justification  for  KL-One’s  links?  There  have  been  many 
philosophical  and  logical  theories  of  the  relations  of  the  One  to 
the  Many  (part-whole,  member-sei-superset.  instance-concept, 
individual-species-genus.  object-Platonic  Form.  etc.).  KL-Onc's 
only  motivation  seems  to  be  as  a  computationally  efficient 
theory  that  clarifies  the  nature  of  inheritance  networks:  but  it 
does  not  pretend  to  ontological  or  psychological  adequacy. 
Indeed,  it  raises  almost  as  many  questions  as  it  hopes  to 
answer.  For  example,  in  KL-One.  instances  of  a  general  concept 
seem  to  consist  of  instances  of  the  attributes  of  the  general 
concept,  each  of  which  instances  have  instances  of  the  values 
of  those  attributes.  But  this  begs  important  philosophical 
questions  about  the  relations  between  properties  of  concepts  (or 
of  Forms,  or  of  ...)  and  properties  of  individuals  falling  under 
those  concepts  (or  participating  in  those  Forms,  or  ...:  some  of 
these  issues  are  discussed  in  (Brachman.  1983).  but  not  from  a 
philosophical  point  of  view):  Are  they  the  same  properties? 
Are  the  latter  "instances"  of  the  former?  Are  there  such  things 
as  concepts  (or  Forms,  or  ...)  of  properties?  And  do  instance 
nodes  represent  individuals?  Do  they  represent  individual 
concepts?  [cf.  (Brachman.  1977):  148.] 

Now.  on  the  one  hand.  SNePS/CASSlE's  arcs  are  also 
taken  to  be  "primitive":  but  they  are  justified  by  the 

Meinongian  philosophy  of  mind  briefly  sketched  out  above  and 
explored  in  depth  in  the  references  cited.  On  the  other  hand. 
SNePS’s  arcs,  by  contrast  to  both  S.NePS/CASSlE's  and  KL- 
One  s.  are  not  restricted  to  anv  particular  set  of  primitives 
e  believe  that  the  interpretation  of  a  particular  use  of  SNePS 
depends  on  the  user's  world-view;  the  user  should  not  be 
required  to  coniorm  to  ours. 

.And,  unlike  KL-One.  the  entities  in  the  ontology  for 

SNePS/CASSIE  are  not  to  be  taken  as  representing  things  in  the 
w'orld:  SNePS/CASSlE’s  ontology  is  an  epistemological  ontology 

(cf-  (Rapaport.  19S5a).  (Rapaport.  19S5b).  (l^apaport.  I986d)] 
consisting  of  the  purely  intensional  items  that  enable  a 
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cognitive  agent  to  have  beliefs  (about  the  world).  An 
epistemological  ontology  is  a  theory  of  what  there  must  be  in 
order  for  a  cognitive  agent  to  have  beliefs  (about  what  there 

is). 


♦ 
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